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Positive Time Preference 





Mancur Olson and Martin J. Bailey 


University of Maryland 


The case for positive time preference is absolutely compelling, unless 
there is an infinite time horizon with the expectation of unending 
technological advance combined with what we call “drastically di- 
minishing marginal utility.” This finding holds both in the positive 
and normative senses. A corollary is that savings are interest elastic. 


Most writers on capital theory have taken it for granted that most 
people systematically prefer present to future consumption. This 
positive rate of time preference is usually ascribed to a “faulty tele- 
scopic faculty,” to “myopia,” or to “impatience,” even though this 
implies an irrationality or continuing perceptual bias that is inconsis- 
tent with the behavioral assumptions economists usually use. To our 
knowledge no one has ever provided convincing evidence that there is 
in fact normally positive time preference, or even specified an empiri- 
cal test capable of determining whether there is or not. Apparently 
the belief in myopia often rests on subjective judgments about the 
relative utility levels of other individuals across different time periods 
that have very limited scientific value. In other cases a preference for 
present over future consumption is erroneously supported with evi- 
dence that, as we shall argue, need reflect nothing more than a 
rational adaptation to positive interest rates generated by the fact that 
capital goods can be useful in production. In still other instances time 
preference is supposed to follow directly from the fact that there is 


We are indebted to Gary Becker, Christopher Clague, Donald Dewey, Robert 
Dorfman, Tjalling Koopmans, Peter Murrell, George Stigler, Paul Wonnacott, and 
Howell Zee for helpful comments and suggestions. Part of the work on this article was 
supported by grants from the Environmental Protection Agency, the National Science 
Foundation, and Resources for the Future. 
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always uncertainty about the future, so the consumer cannot be sure 
whether any assets that he acquires by saving will retain their value or 
even that he will be alive to enjoy any fruits of his saving. In fact, 
uncertainty about future economic conditions and the duration of 
one’s life can increase as well as decrease the incentive to save. 

This paper develops an approach to intertemporal choice that 
makes it possible to determine from actual household choices in 
certain circumstances whether or not there is positive time prefer- 
ence. The empirical tests on ordinary saving and consumption be- 
havior that grow out of this approach suggest that a-decidedly positive 
rate of time preference is typical if not almost universal. There is an 
alternative interpretation of the evidence that permits doubts in cases 
where the planning horizon is infinite, and this we label “drastically 
diminishing marginal utility.” This intriguing phenomenon could in 
principle be widespread. In long-run contexts it is so similar to time 
preference in its implications that the two are difficult to distinguish. 
Nonetheless, especially when short- as well as long-run contexts are 
considered, the case that there is often time preference becomes 
compelling, even if drastically diminishing marginal utility should also 
be present. 

We shall also show that those who advocate certain environmental 
and resource policies on the ground that the interests of all future 
years and generations ought to be weighted equally with our concern 
for utility in the present are contradicted by their own behavior. We 
will similarly show that it is logically inconsistent to argue that there is 
positive time preference and that household savings do not generally 
respond to changes in interest rates. 


I. Positive Time Preference and Diminishing Marginal Utility 


Often the idea of time preference appears in an impressionistic form 
without any workable definitions (see, e.g., Pigou 1960; Jevons 1965), 
though there are also a number of quite useful efforts to obtain a 
precise understanding of the matter.! Interestingly enough, the 
clearest conception of positive time preference that we have been able 
to find was in BOhm-Bawerk’s original account: 


Present goods have in general greater subjective value than future 
(and mtermediate) goods of equal quantity and quality... . 
A first principal cause capable of producing a difference in 


"See the helpful discussion in Marglin (1963), Feldstein (1964), and Hirshleifer 
(1970). There are also important articles that appeared somewhat earlier, notably 
Samuelson (1937), Strotz (1956), Debreu (1959), and Koopmans (1960). 
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value between present and future goods is . . . if a person 
suffers in the present from appreciable lack of certain goods, 
or of goods in general, but has reason to hope to be more 
generously provided for at a future time, then that person 
will always place a higher value on a given quantity of imme- 
diately available goods than on the same quantity of future 
goods. This situation occurs with very great frequency in our 
economic life... . 

We must now consider a second phenomenon of human 
experience—one that is heavily fraught with consequence. 
That is the fact that we feel less concerned about future 
sensations of joy and sorrow simply because they do lie in the 
future, and the lessening of our concern is in proportion to 
the remoteness of that future. Consequently we accord to 
goods which are intended to serve future ends a value which 
falls short of the true intensity of their future marginal 
utility. [1959, pp. 265-73] 


This quotation suggests two distinct elements, each of which might 
influence intertemporal choice: (1) diminishing marginal utility of 
consumption at any given time, where utility is an unchanging func- 
tion of the amount of consumption at the time, and (2) discounting of 
future versus present utility. This distinction is so fundamental that 
we believe there can be no adequate explanation of intertemporal 
choice without it, yet it is too often ignored. Each of these two ele- 
ments, moreover, naturally suggests a testable hypothesis that can be 
proven false. Bailey, Olson, and Wonnacott have shown elsewhere 
that “The Marginal Utility of Income Does Not Increase” (1980) and 
in at least a great majority of cases is actually decreasing. In this paper 
we accordingly accept Bohm-Bawerk’s assumption that there is di- 
minishing marginal utility to consumption in each period and go on to 
explore whether the available evidence is consistent with the hypothe- 
sis that utility in the future is valued less than utility in the present. In 
order to do this, and to obtain the full range of hypotheses that can be 
drawn with BOhm-Bawerk’s distinction, we shall first develop a more 
formal statement of the distinction. 

Following a suggestion by Hirshleifer (1970),? we represent utility 
at moment ¢ by a function v(C;,); the consumption C; is time sub- 
scripted, but the function is time invariant, depending only on the 
level or quantity of C;, regardless of ¢. Then with time measured in 
terms of discrete periods of equal.length, we write the consumer’s 


2 The Hirshleifer analysis in turn has precedents in the articles by Samuelson (1937), 
Strotz (1956), Debreu (1959), and Koopmans (1960) cited in the preceding footnote. 
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utility function to be maximized at time zero, subject to a wealth 
budget constraint, as 
T 

U= Dea o 

otha”) 

where the consumer’s time horizon for planning is 7, which need not 
be finite, and where 7 is a constant, the rate per period of positive 
time preference. We assume that the first derivative v’ is positive and 
(because of the finding mentioned before) that the second is negative, 
uae ()) 

Now consider a household choosing among alternative combina- 
tions (Co, C,) of consumption at ¢ = 0 and ¢ = 1, holding its consump- 
tion pattern for all later periods fixed. If we take the differential of 
(1), setting dC, = 0 for all ¢ = 2, we have 





dU = v'(C,)dC, + BC Ges (2) 
ay 

and setting dU = 0 to represent holding the household indifferent, 

LC iC ee 

17 
or 
dC, v'(Co) 
= a Nol! (Ie 
ic ee OCHS alee : 


The household will demand an increase in C, greater than the reduc- 
tion in Co, that is, 








Cyl 
to hold utility constant, if either of two things is true: 
C, > Cy with yn = 0, (4) 
or 
He with C, = Co. (5) 


Conditions (4) and (5) succinctly state the two causes of interest in the 
quoted passage of BOhm-Bawerk. 

Inequality (4) follows because v" < 0. Diminishing marginal utility 
of income implies that, if all else is equal, a household with greater 
consumption potential in the later period will be willing to borrow at 
a positive interest rate in order to make consumption in the two 
periods more nearly equal. This is the “first cause” of the preference 
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for present over future consumption mentioned in the quotation 
from Bohm-Bawerk. 

The hypothesized second cause is represented by the constant 7 in 
the intertemporal utility function (1). When it is positive rather than 
zero as in (5), the household would be willing to pay some positive 
interest rate to increase consumption in the initial period at the 
expense of consumption in the later period, even with equal endow- 
ments in the two periods. 

By tracing out the combinations of Cy and C, to which the house- 
hold is indifferent, using (1) or (3), assuming v(C;) and 7 are known, 
one can obtain an indifference curve between present and next- 
period consumption. That is, equation (1) is an implicit function of Co 
and C,, when all other C; are held constant and when U is also held 
constant; this function, 


v(C,) 


Up») =v(Co) + 1+7 


+ const., 
is the equation of an indifference curve in (Co, C,) space. Such an 
indifference curve U, is shown in figure 1. Along the 45° line Cy = C,, 
and on this line any tangent to an indifference curve, such as ABD, has 
a slope of —(1 + 7). This can be seen by setting Cy = C, in equation 
(3), or by noting that when consumption is the same in both periods, 
any preference for present over future consumption must be due to 
a 

Though the marginal rate of substitution between consumption in 
any two periods for a household in equilibrium is frequently called 
the rate of “time preference,” which is of course equal to the interest 
rate, this casual usage has caused a lot of confusion. We therefore 
strongly suggest that “positive time preference” be defined to exclude 
the effect of a difference in marginal utility due to any lower level of 
consumption in the present and, therefore, to include only any pref- 
erence for present over future consumption due to other causes. In 
our formulation, time preference is then defined as a positive value of 
the constant 7, rather than in terms of the entire expression (3), which 
gives the slope of the indifference curve. The ratio of marginal 
utilities, [v’(Cy)]/[v'(C,)], depends not on time but on levels of con- 
sumption in the two periods; any preference for present over future 
utility, by contrast, can be conveniently represented by the subjective 
discount rate 7. 


II. Separable Utility 


Before we can properly derive our positive results, we need to con- 
sider whether the view of intertemporal choice set out in equation (1) 
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is appropriate. In particular, we need to consider whether it is neces- 
sary or expedient to write utility as a function of consumption in every 
period: 


U = U(Co, Ci, C., Sheen sai Cr). (6) 


This formulation, unlike that in equation (1), would formally encom- 
pass any complementarity between the levels of consumption in dif- 
ferent periods. Indeed, one could go further and write utility as a 
function of the amount of each separate good consumed in each 
period—as would be necessary, for example, if the typical household 
consumed a little cream today to go with the coffee it will consume a 
week from Monday. In other words, the utility derived from a given 
level and pattern of consumption in a particular period could con- 
ceivably depend not only on the level but also on the composition of 
consumption in prior periods. There are examples of interdepen- 
dence of consumption on different dates, such as a preference to have 
different dinner menus on successive days, and it could be necessary 
for some special purposes to take these into account. However, as 
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Samuelson (1937) has argued, functions that allow unlimited inter- 
relationships become so general as to be almost vacuous. If a utility 
function rules nothing out, it will be consistent with almost any possi- 
ble type of behavior and thus generate no potentially falsifiable 
hypotheses of interest. In addition, it has been shown elsewhere 
(Bailey and Olson 1977; Bailey et al. 1980) that the assumption of 
separability is more realistic than is usually supposed. Finally, it is 
hard to imagine how any plausible type of temporal complementarity 
could generate results that are qualitatively different from those in 
this article, with one obvious exception we can handle now. 

An individual cannot, of course, derive utility from consumption in 
any period unless he has survived prior periods and also preserved 
the health needed to enjoy consumption. This could be accommo- 
dated formally by treating the essentials needed to preserve life and 
health as crucial investments in human capital. This would not really 
solve the problem, though, since the temporal interdependence 
would still have to be taken into account in investment decisions. 
Similarly, if a family is to have grandchildren or more distant descen- 
dants, then each intermediate generation must have at least enough 
resources to survive long enough to raise children. Thus we em- 
phasize that our analysis must be modified before it can be applied in 
any situations where levels of consumption too low for healthful 
viability or reproduction are at issue. Abstractly, we deal with this 
starvation level of consumption oC by supposing that v'(C) > ~ as OS 
oC* and that v(C) is undefined for C < ,C. Given this qualification, we 
can safely continue to use function (1). 


Ili. Erroneous Inference of Myopic Behavior 


In Section I we distinguished time preference from the effects of 
diminishing marginal utility of consumption in each period. With the 
aid of this distinction we can clear up one elementary yet surprisingly 
common error. Since Bohm-Bawerk’s time, if not before, some 
economists have assumed that positive interest rates indicate that most 
people have positive time preference. Though we shall later provide 
evidence that time preference is in fact positive, this finding by no 
means follows from the existence of interest. 

This is obvious once one considers the implications of the fact that 
capital goods can be useful in production. The existence of profitable 
investment opportunities implies that reduction of consumption by a 
dollar now will allow more than a dollar’s increase in consumption 1n 
the future. A glance at equation (1) reminds us that even if time 
preference (7) were zero, this fact can generate positive interest rates, 
because diminishing marginal utility of consumption in each period 
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entails that consumers will accept a lower level of consumption in one 
period than another only if they are rewarded for doing so. Obvi- 
ously, positive interest rates and the excess of marginal utility of 
present over that of future consumption that they induce? does not by 
itself justify the conclusion that there is any preference for present 
over future utility, much less persistent myopia or irrationality. 

When one looks at both the demand and supply sides, it becomes 
clear that each of the following combinations of conditions is 
sufficient to generate the customary positive interest rates: (1) di- 
minishing marginal utility plus profitable investment opportunities 
with nondecreasing income (and zero time preference); (2) di- 
minishing marginal utility with increasing endowment income (and 
zero time preference); (3) positive time preference with constant 
income or with profitable investment opportunities (and nonincreas- 
ing marginal utility). Therefore, the fact that interest rates are posi- 
tive does not by itself tell us which causes are operative. Only evidence 
of a different kind could demonstrate that there is positive time 
preference. 

The rudimentary considerations just mentioned are, however, 
sufficient to tell us something about the relationship of 1, whether it 
be positive, negative, or zero, and the prevailing interest rate. Con- 
sider initially a manna-from-heaven world with no producers. Such a 
world could have a consumption-loans market with a determinate rate 
of interest in long-run equilibrium. Since we have not specified the 
times at which manna falls from heaven, or yet offered any evidence 
that would either entail or rule out positive time preference, the 
interest rate in this consumption-loans market could be anything— 
even zero or negative. If the manna falls evenly over time, the interest 
rate in this consumption-loans market is bound to be equal to 7. 

But now suppose that advantageous production processes are dis- 
covered in which produced capital is an argument in the production 
function for consumption goods. Since this discovery will increase the 
demand for borrowings but not shift the supply function for loans, 
the interest rate must rise. The result is that in equilibrium r > 7. 
Except for a few extreme cases, the real world will be one in which r > 
7, because in almost all real-world cases capital goods have a positive 
net marginal product. The marginal rate of substitution between 
present and future consumption in long-run equilibrium will of 


* The difficulties involved in any effort to get one unambiguous measure of the 
capital stock and the extended controversies that have grown out of this should not, of 
course, obscure the fact that capital accumulation requires postponement of consump- 
tion. Bohm-Bawerk’s writings, despite their well-known shortcomings, are sufficient to 
show this (see Dorfman 1959a, 1959). Thus the presence of “reswitching,” e.g., does 
not undermine our analysis here. 
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course always equal the rate of interest, but 7 is not in general equal to 
this marginal rate of substitution, even if 7 is positive. 


IV. Is Time Preference Observable? 


It is intuitively obvious from the discussion of figure 1 and from what 
has just been said that we can ascertain an individual's rate of time 
preference by observing the interest rate at which he chooses a level 
consumption stream. This means that we have a firm and simple basis 
for using a revealed preference approach to obtain solid scientific 
evidence about time preference and have no need to appeal to any- 
thing resembling introspection or interpersonal comparisons of util- 
ity.* 

The conclusion that the household chooses a level consumption 
stream if the rate of interest equals the rate of pure time preference 
clearly holds for the multiperiod as well as for the two-period cases 
(see Samuelson 1937). This finding is itself enough to make time 
preference observable. We noted earlier that, unless incomes are 
declining, the existence of productive investment opportunities im- 
plies that the equilibrium rate of interest will be greater than the rate 
of time preference, and this is true however high the rate of time 
preference might be. Therefore we would predict that in most cases 
households would have positive levels of savings that generate a rising 
level of consumption over time. This theoretical prediction appears to 
be relatively realistic. 

There are also other ways to measure time preference from re- 
vealed preferences. If the household maximizes utility (1) subject to 
the budget constraint 


Vee Oy 
2 asy % @) 


varying each C;, to obtain this maximum, the result implies 


Oy it ray 
aC) (+i¥ . 


4 In seeking revealed preference we are, of course, only continuing a line of inquiry 
started by others. In 1937 Samuelson proposed a hypothetical method for obtaining the 
utility function by integration, i.e., by mapping observed choices into a functional 
representation; in effect this was applying the revealed-preference approach to our 
problem. Similarly, Debreu (1959) showed, with a rigorous proof, that a “pre- 
ordering,’ ora consistent, transitive set of preferences, implies a utility function that is 
unique up to a linear transformation. Frisch’s (1959) analysis also proceeded in a similar 
spirit. There are also others who have made significant contributions to this way of 
researching the problem, such as Strotz (1956) and Hirshleifer (1970), whose contri- 
butions will be discussed later. 
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with a constant interest rate 7 for all ¢. Evidently, as we noted above, C, 
= Cy whenr = n, so that when we find the rate of interest rp at which 
the household chooses a level consumption stream (i.e., zero savings 
from Y%), then 7 = 79, and the value of 7 has been revealed. 

Similarly, if r > 7, as it usually is, C; > Co. If one has independent 
information on how rapidly v‘(C) declines with increasing C (see 
Bailey et al. 1980), one can estimate 7 simply by examining the chosen 
growth path of consumption and finding the 7 that reconciles it with 
equation (8). 


V. Uncertainty 


Though there are some notable exceptions, such as Ramsey (1928), 
and more recently Stigler and Becker (1977), most economists appear 
to take it for granted that time preference is positive. Sometimes this 
conclusion seems to rest not so much on the erroneous inferences we 
described in Section III as on the notion that the individual is uncer- 
tain about the date of his own death and will therefore discount 
prospective utility because of the possibility he will die earlier than 
expected. This argument is quite unsatisfactory for at least two rea- 
sons. 

First, uncertainty about the date of one’s death can increase as well 
as decrease an individual’s provision for the future, because it can 
create an incentive to save more to provide undiminished consump- 
tion in the event of an unexpectedly long life. If income could 
costlessly be converted into an annuity, this would not be the case, but 
ideal annuities are not available, and in less developed economies they 
often cannot be purchased at all. It is an empirical question whether 
uncertainty about the length of one’s life increases or decreases provi- 
sion for the future. 

A second reason why uncertainty about the date of one’s death does 
not entail positive time preference is evident from the fact that there 
is normally positive saving in the society as a whole. Even if there were 
negative time preference over the life cycle, that is, a preference for 
utility in later as against earlier years of life, a society with no popula- 
tion growth or technological advance would have no social saving 
whatever if everyone consumed his entire output over the course of 
life: The savings in one set of years would just equal dissaving in 
another. In fact, the developed societies have saved enough to 
provide for a considerable long-term growth of tangible capital per 
workers as well as a vast increase in the amount of education and 
human capital. This substantial social saving had to be due in large 
part to a desire to bequeath increased consumption opportunities to 
descendants. The desire to bequeath assets to descendants provides 
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another reason why the argument that uncertainty about the date of 
one’s death implies positive time preference is inadequate. 

There is also uncertainty about whether the assets obtained 
through savings will generate the promised yield or even preserve 
their value. Such risks lower certainty-equivalent rates of return, and 
so are different and distinct from time preference. Risks may be 
reduced through diversification of portfolios, but transactions costs 
limit the extent to which risk can be eliminated in this way for the 
household of average means, and there are in addition great prob- 
lems for such a household in diversifying against the risks to property 
in any single country (or single set of countries that might have a 
common destiny). Thus the risk that investments will prove unre- 
warding would appear to reduce saving, just as a positive rate of pure 
time preference would. 

Uncertainty nevertheless also has two important countervailing ef- 
fects that have been neglected, if noticed at all, in the literature. The 
first of these arises from risk aversion and from the absence of futures 
markets and insurance markets for most goods. The absence of fu- 
tures markets makes it impossible for a typical household to hedge 
against a loss of demand for whatever specialized skill it has to offer, 
or against the loss in value of whatever other assets it possesses, or 
against a depression. Similarly, a typical household cannot insure 
against the birth of a child that will never be capable of supporting 
itself or even fully insure against a loss of the skill or health needed to 
generate income. These unhedgeable and uninsurable risks to the 
total income-earning capacity of a household, in combination with 
risk aversion, entail an increased incentive to save. In other words, 
even uncertainty about the security of assets can work to increase as 
well as to decrease savings, because in unfavorable contingencies 
assets contribute a large increment to prospective utility. This impli- 
cation is corroborated even by everyday language, as when people 
speak of “saving for a rainy day.” Second, the greater the amount 
saved, the smaller in general will be the ratio of the transactions costs 
of diversifying the porfolio or protecting it in other ways’ to the value 
of the portfolio. In summary, uncertainty may reduce the expected 
apparent rate of return to saving, yet at the same time in the absence 
of universal futures and insurance markets it can increase the “need” 
or incentive for saving, that is, its prospective utility. Because, in 
addition, uncertainty may favor accumulation of enough wealth to 


5 In the case of households whose assets are acquired by illegal or unpopular means, 
or which become so large they generate notoriety, additional saving may if anything 
increase the possibility of prosecution, extortion, or adverse political decisions. Yet 
concealment and diversification can reduce these risks also, perhaps more easily as 
wealth grows larger. 
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make diversification or other forms of portfolio protection practical, it 
need not lower even the apparent rate of return. 

The question of whether those uncertainties that imply the same 
behavior as positive time preference are quantitatively more 
significant than those that imply the same behavior as a negative time 
preference is an empirical question. Yet to the best of our knowledge 
it has not even been addressed. Nor are we aware of any persuasive 
empirical literature on whether other possible influences on saving 
behavior, such as irrationality or perceptual bias, combine with the 
two opposing effects of uncertainty to create an appearance of time 
preference that is on balance positive, or zero, or negative. 


VI. Is Time Preference Positive? 


The argument in prior sections helps us address this question now. 
We shall see in the next section that there are reasons for assuming 
that some households can have an indefinitely long or infinite time 
horizon, and that the conclusion we shall draw with this assumption 
holds a fortiori when there are shorter planning horizons. If n = 0, 
from equation (8) we have 


O(G) = ] 
0) (Co) oe ee ©) 


If we then extend the time horizon to later and later ¢ and apply (9) to 
the “last” period in the future, we obtain 


Pe oe 

lim UCD — 0, (10) 
This result implies that with the continuing positive interest rates that 
normally prevail and an 7 of zero, the household would lower Cy to 
the desperation level, where v'(Cy) = ~, if the Cy attainable under the 
budget constraint (7) is below the satiation level, at which v'(C7) = 0. 
Inasmuch as we normally do not observe either people who cut 
consumption down to utterly abject levels in order to provide for the 
future or people who have because of past saving reached consump- 
tion levels at which their taste for goods is satiated, there appears to be 
a positive rate of time preference, that is, in our formulation an ee Ue 
(We discuss later a possible escape from this conclusion if there is both 
an infinite time horizon and what we call “drastically diminishing 
marginal utility.”) 

We have shown earlier that r is normally greater than 7 because 
capital is useful in production, and the present argument suggests 
that if r exceeded 7» by a very large amount that would also generate 
implausibly low levels of present consumption. This suggests that, at 
least in those societies where real interest rates are very high and 
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expected to remain so, 7 must exceed zero by more than a miniscule 
amount.® 

Before turning to finite planning horizons, we should examine a 
familiar policy conclusion which assumes an infinite time horizon. 
This is the prescription of those who claim or imply that, at least when 
ecological systems or depletable resources are at issue, the only right 
and moral attitude is, in effect, a zero rate of time preference and an 
infinite time horizon—the interests even of distant generations 
should have the same weight as one’s own interest in current-year 
consumption. One need not have a what-has-posterity-ever-done- 
for-me attitude or a shortsighted view to oppose this supposedly 
moral command. It implies, as a glance at (10) will remind us, that at 
every positive interest rate the household should reduce its consump- 
tion to the level at which the marginal utility of current consumption 
is infinite, in order to give all to the future. Of course, that applies to 
every future year, and to every future generation, until either income 
rises so high that the marginal utility of consumption (when con- 
sumption equals income) has fallen to zero, or alternatively there is so 
much capital around that the interest rate is no longer positive. We 
doubt that the proponents of a zero time preference and an infinite 
time horizon have understood the implications of their argument and 
conclude that their argument surely cannot be a proper basis for 
national policy.’ 

The existence of positive time preference also has immediate impli- 
cations for the responsiveness of savings to changes in interest rates. 
We earlier defined 7 as zero for a household if, when it had equal 
endowments in each period and faced an interest rate of zero, it chose 
a stable level of consumption. A positive 7 1n these conditions, or any 
y greater than the interest rate, implies a decreasing level of con- 
sumption through borrowing or dissaving, whereas anr > 9 implies 
net saving. If n = 7 at any level, there is a stable level of consumption. 
Starting from sucha situation and looking at equation (8), we ask what 
would happen if the interest rate fell to zero. The analogy with the 
results in equations (9) and (10) immediately reminds us that this 
would give us 


tC) any 


rc ea iaiaeel ail i 
As T gets indefinitely long, this would give 


ae (Co) 


6 We are thankful to Peter Murrell for calling this point to our attention. 
7 For a related finding, see the Koopmans (1960) article discussed below. 
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We see that either Cy increases until v'(C)) = 0, or v'(C7) decreases 
until v'(C;) > ». That is, the ratio Cy/C, will become huge: The 
family will dissave on a large scale, unless it somehow suffers a per- 
manent plight so severe that v'(C,) goes rapidly boundless with a small 
reduction in C;, or so near satiation of the capacity to enjoy goods that 
v'(Co) > 0 quickly. This shows that it is logically contradictory to hold 
that the representative household is myopic or for any other reason 
has positive time preference, and also to hold that savings do not 
respond to changes in the interest rate and would normally remain 
positive at a zero interest rate.® (In fact, saving would be sensitive to 
the rate of interest even if time preference were negative and, since 
the marginal utility of consumption is normally diminishing, also if it 
were zero in any environment in which endowments vary over time.) 
We think this implication may surprise many readers. It will be clear 
from the next two sections that this conclusion, like the others, also 
holds true with finite time horizons. 


VII. Truncated Time Horizons 


Most significantly, the case that there is positive time preference— 
that is, something beyond Boéhm-Bawerk’s “first cause” —gets 
stronger as the planning horizon gets shorter. The reason for this is 
utterly obvious once stated: A finite planning horizon of T years 
implies that whatever happens after T years is not only discounted but 
given a value of zero. Were there not usually markets in which one 
can trade with those whose time horizons extend farther into the 
future, an individual would place no value at all on an asset, such as a 
bond, that could be redeemed or sold only after the end of his 
planning horizon. Truncating a decision maker’s time horizon can 
therefore only strengthen the evidence for a positive time preference, 
since it merely involves attributing a zero value, rather than a dis- 
counted but positive value, to any consumption after a certain date. 

The fact that truncated time horizons lead so directly and simply to 
positive time preference makes one ask how common and how long 
truncated time horizons might be. Are observed Savings rates, which 
as we have shown are far lower than a zero time preference and an 
infinite time horizon would imply, as low as they are simply because 
the behavior we actually observe is driven by time horizons on average 
about a lifetime or so in length? 

The notion that time horizons might average about a lifetime in 


* We have in mind a compensated saving schedule, along which household real 
Income is constant, so that the effect of a change in the interest rate is a pure 
substitution effect. 
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length has some appeal at first glance. It certainly must be an overes- 
timate for older people without heirs or other survivors about whom 
they are concerned. Probably it is about right for some others too, 
who for one reason or another may feel that the cares of one lifetime 
are enough. The number with concern for heirs is probably larger, 
but it appears that mainly this concern extends only a generation or 
two into the future. Even in dynastic families, such as that of the 
Capetian kings of France, we may wonder whether they thought 
ahead much beyond producing-a male heir and preserving his birth- 
right. Taking the different types of cases together, it might seem 
that truncated time horizons might average out to perhaps a lifetime, 
and that this is the main explanation of the positive 7 that we inferred 
from the small percentage of incomes that are saved. 

This interpretation is nonetheless unsatisfactory, as we realize from 
our earlier reference to the fact that lifetime horizons imply no saving 
by societies, unless the age distribution for a time contains dispropor- 
tionate numbers of people in the “savings” stages of life. There is 
substantial social saving, especially when we include education and 
human capital as we must, that is not explainable by growth, so the 
typical time horizon must be more than a generation in length.® 

This consideration in turn means that there is no special need to 
know the rate of time preference within planning horizons of a 
lifetime or less in length. Though individuals with time horizons as 
short as this presumably exist, the evidence adduced above suggests 
that they are exceptional. It is conceivable that these exceptional 
individuals could have a zero or negative time preference over the 
course of their planning horizons. If so, there is a remarkable discon- 
tinuity, for we know from the definition of a finite planning horizon 
that nothing that occurs after it is even worth taking into account. The 
dominant fact is that the time preference for consumption within the 
planning horizon is infinite compared to anything that happens after 
it. Whether one is interpreting aggregate time series on saving and 
accumulation or analyzing social policy about them, this infinite time 
preference is the dominant fact, besides which the presence or ab- 
sence of time preference within the life span pales to insignificance. 

It is indeed fortunate that time preference within planning hori- 
zons of a lifetime or less in length is not the key issue, for (contrary to 
what we once believed) it is almost impossible to find out what time 
preference over the life cycle might be. Family size and optimal 


® Growth due to technological advance in what would otherwise be a steady state also 
implies some net social saving, because young savers have higher lifetime incomes than 
have older people, so that their total saving exceeds the dissaving of retired people. 
However, White (1978) and Kotlikoff and Summers (in press) find that total saving far 
exceeds the amount that can be attributed to life-cycle saving in a growing economy. 
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periods for major investments in human capital change over the 
course of life, and though it is easy to incorporate these considerations 
into our theoretical framework, the changes in spending and savings 
levels they bring about are difficult to disentangle from any short-run 
time preference. Even the individual’s capacity to enjoy consumption 
may vary with age, as is suggested by the cliché that one should not 
postpone high levels of consumption until one is “too old to enjoy it.” 
In most households the wage rate also changes substantially with age, 
and this again affects the level of savings in each period. There are 
even problems arising because of changes in the value of time due to 
these changes in wage rates. The final outputs desired by consumers 
are characteristically produced through household production func- 
tions in which time and market goods are inputs. As Becker and Ghez 
(1975) have shown, changes in wages over time can affect savings 
levels in at least two significant ways. First, the higher the wage rate 
the greater the incentive to substitute purchased goods for time in the 
domestic production of final outputs, as may occur when a family 
increases its reliance on caterers, convenience foods, and restaurants 
when wage rates of family members are unusually high. Second, the 
anticipation of a different opportunity cost of time in the future can 
change the time profile of final consumption and savings, so that 
there is more consumption in periods when a lower cost of time makes 
the price of final output cheaper, as could occur when a family 
postpones the spending involved in distant tourism till retirement. 
Given the unknown and sometimes contrary effects of these and 
other influences, it is not immediately clear what levels of savings 
would support or refute the hypothesis that time preference is posi- 
tive within time periods of a lifetime or less. 


VIII. Bequests 


Let us now return to the more representative case of a household with 
heirs for which it has some concern. What a couple during the course 
of life gives to its children and what it bequeaths to them has a capital 
value equal to the discounted future value of the consumption (and 
later bequests) it is expected to provide. In the minds of the givers this 
bequest must have a prospective value or utility at least equal to the 
utility forgone by not consuming it themselves. The extent of any 
willingness of the household to reduce the heads’ current consump- 
tion to provide for their children’s endowments is the central param- 
eter revealing the presence or absence of the “myopia” discussed in 
debates about the determinants of saving. 
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A household with the faith that it will have a continuing line of 
descendants with the same regard for their children as it has for its 
own will act as if it had an indefinitely large or infinite time horizon, 
even if it gives such a long time horizon no explicit attention. Suppose 
that the household spends that amount on education, gifts, and be- 
quests that strikes its preferred balance between its own desire to 
consume and its provision for the children’s future, and leaves it to 
the children to allocate their resources in turn. When the problem is 
understood in this way it becomes clear that there is an extensive 
theoretical literature that is directly applicable, notably that of 
dynamic programming. One of the basic findings in this literature is 
that the optimal program for several periods is the same as that 
reached by optimizing each period in turn, including setting optimal 
end-of-period conditions each time for the beginning of the next 
period (see, e.g., Bellman and Kalaba 1965, chap. 2); the result is the 
same whichever way one looks at the problem. That this notion 
applies precisely to intergenerational resource allocation is shown by 
Barro (1974). It can accordingly be appropriate to view the choice of 
the size of the bequest as if the household were maximizing a utility 
function of the form (1) with an indefinitely large, or infinite, time 
horizon. The T represents not how far into the future people plan 
consumption, but the length of a dynamic program implied by a 
continuing series of intergenerational transfers. 

Let us now suppose that the conditions that make it appropriate to 
assume an infinite time horizon do not hold, so that finite time 
horizons such as we used to develop equations (1)—(8) are appropri- 
ate. If the present heads of household are utterly unconcerned about 
any descendants beyond, say, their grandchildren, this lack of con- 
cern will manifest itself in the form of a smaller bequest than other- 
wise. Uncertainty about whether a subsequent generation will have 
offspring, or fail to make a bequest to its children, will also make for a 
smaller bequest. The effect of a lack of concern for more distant 
generations, or of uncertainty about whether the bequest will in fact 
serve the givers’ purposes, is approximately the same as if the house- 
hold applies a high rate of time preference 7 in maximizing utility (1) 
with a large or infinite T. We can see this by taking equation (1) and 
making v(C,) a declining function of time, for example, v(C,t) with v; < 
0. This is in effect equivalent to keeping (1) as it is and raising the 
value of 7. This makes it clear in another way that less-than-infinite 
time horizons entail positive time preference. Thus the decision to let 
T in equation (1) be finite at some points, and to let it approach 
infinity in Section VI, was governed largely by analytical convenience 
and had no substantive effect on the conclusions. 
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IX. Drastically Diminishing Marginal Utility 


With infinite time horizons and continued positive interest rates the 
returns to savings ultimately become so colossal that, if there were not 
positive time preference, current consumption would be drawn down 
to survival levels. There are, however, two phenomena in addition to 
positive time preference that conceivably could explain the high levels 
of consumption that actually occur. One is that such large increases in 
endowment income are expected that saving for still more future 
consumption is not worthwhile however much interest would ac- 
crue.!° The other possibility is that the marginal utility from con- 
sumption in any period diminishes so drastically as consumption rises 
that even essentially unlimited interest income would not yield 
enough future utility to justify much saving. There is no reason for 
most households to assume increases in endowments so great that 
they could justify the low observed savings rates. It is, on the other 
hand, just possible that drastically diminishing marginal utility in 
combination with plausible expectations about increasing endowment 
incomes, perhaps due to technological advance, could explain the 
observed levels of saving. 

Suppose, for example, that the household expects an exponentially 
rising real income due to technological advance, even if its saving and 
aggregate savings are zero. Equation (8) implies a chosen rate of 
growth of consumption that is independent of the horizon T; this 
growth will be exponential if marginal utility is a constant-elasticity 
function, that is, if utility has the form 


UC) AGB (13) 
fora < 0,A < 0, and B > 0, or, for the unit elastic case, the form 
v(C) =A 1nC+B (14) 


with A > 0 (see White 1978). For Harrod-neutral technological ad- 
vance, there is some fraction of income saved that is consistent with 
having consumption grow exponentially at the same rate p as income 
does. If so we can have v'(C7) > 0 as T > & because of this growth. 
For example, suppose that utility is given by the Bernoulli function 
(14). Then the optimum consumption pattern implied by (8) is 


Uke, 
ee aay 


for every ¢. If it should happen that 1 + p = (1 + 7)/(1 + NOL; 
approximately, 


Co (15) 


'° We are grateful to Christopher Clague for calling our attention to this possibility. 
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PST 7; (16) 


consumption will grow at the same rate as income and will remain a 
constant fraction of income, as is actually observed. This pattern can 
be sustained indefinitely and can plausibly be reconciled with a con- 
stant rate of interest (see Solow 1970, chap. 5). 

If the rate of time preference were zero, equation (15) would imply 
that consumption rises indefinitely at the rate 7. When 7 > p, as has 
been true of the United States, this high rate of growth of consump- 
tion can be maintained for large T only by pushing Cy down toward 
zero, pushing the fraction of income saved toward unity. 

This set of results and considerations makes a compelling case for 
the existence of a positive rate of time preference, unless one sup- 
poses that the utility function v(C) is inelastic and that unlimited 
exponential growth of income is expected even with zero saving. In 
the past, output per unit of input has grown at about | percent a year, 
and the U.S. birthrate has converged on zero population growth. If 
the real rate of interest of about 5 percent (which has prevailed 
during most of the past several decades) is what households expected 
to continue into the indefinite future, these figures can be shown to 
reconcile with zero time preference only if the elasticity of the utility 
function (13) is minus 5 corresponding to a = —4. This function has 
what we shall label “drastically diminishing marginal utility.” We do 
not argue that utility functions of this character are the norm but 
know of no data that rule them out and think it is not without interest 
that (when exogenous increases in endowments are expected and 
time horizons are also infinite) the facts of saving force us to choose 
between positive time preference and drastically diminishing margi- 
nal utility of income. 


X. The Relationship to Prior Findings 


Though many of the prior findings on time preference contradict one 
another, and there has been no generally accepted analysis of the 
matter, many of these writings nonetheless have considerable value. 
With the aid of the findings here we are in a better position to 
understand or appreciate some prior contributions. 

One of the best-known contributions is that of Strotz (1956). The 
extreme simplicity of Strotz’s insight and its relationship to our argu- 
ment can be made immediately apparent. Suppose that an individual 
has a positive time preference, in that he values consumption in the 
future less highly than consumption now, but it is only the distance 
into the future, rather than the specific date, that determines the 
weight given to the utility. Suppose for the sake of a simple example 
that the marginal utility of consumption in any one period is the 
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same at all levels of consumption, however high or low, so that we can 
ignore the diminishing marginal utility of consumption and focus 
exclusively on Strotz’s insight. Let us suppose that our individual must 
obtain all consumption out of an initial endowment of 25 and that his 
time preferences are such that he chooses initially to consume 20 
units, or four-fifths of his stock, in the present period, three units in 
the second period, and two units in the third. When the first period is 
over, however, the individual will abandon his initial plan and con- 
sume four-fifths of his remaining endowment of five, or four, since by 
stipulation he weighs the utility in the present so highly he consumes 
four-fifths of his stock, and period 2 is now the present. This is of 
course inconsistent with the initial plan for consumption in the second 
and third periods; the initial plan has not been followed, even though 
all of the individual’s expectations about his situations and tastes have 
proved correct. The point is that no individual with the pattern of 
time preference described will voluntarily adhere to the plans he 
makes in advance; it is impossible for him to maintain his pattern of 
time preference and also adhere to the intertemporal plans he makes 
at the beginning. It becomes obvious on reflection that the only 
patterns of positive time preference that individuals can have and still 
make consistent or rational intertemporal choices are those which 
involve a constant exponential rate of discounting: a constant » such 
as we have all along assumed. If an individual plans to save the same 
percentage of his period 2 stock as of his period 1 stock, he will in fact 
carry through with that plan if his tastes do not change. Of course, no 
positive time preference at all (y = 0) is consistent with rational 
behavior; it would have led our hypothetical individual to consume a 
third of his stock in each of the periods and to have adhered to that 
plan. Thus we see that it is not consistent to assume both rational 
behavior and positive time preference without also assuming that 7 1s 
an exponent of constant value. 

Unfortunately, Strotz goes beyond the insight just described to 
define “prudent” or “thrifty” behavior as any behavior which meets 
the standard of intertemporal consistency: An individual is arbitrarily 
defined to be “thrifty” if he discounts the future at a constant expo- 
nential rate, however high that rate may be. In keeping with his 
preoccupation with perceptual error, inconsistent choices, and the 
psychology of intertemporal choices, Strotz concludes that “consumer 
sovereignty has no meaning in the context of the dynamic decision- 
making problems” (1956, p. 179). 

Though Strotz’s disturbing conclusion is consistent with the rest of 
his argument, we can now readily see that it is not in fact justified. Ifa 
positive time preference, which he calls “myopia,” were solely the 
result of irrationality and perceptual errors, then consumer 
sovereignty would indeed have little or no meaning for intertemporal 
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choice. But as we have seen, the absence of a positive time preference 
implies extreme behavior, or else drastically diminishing marginal 
utility. Short planning horizons can occur and need not imply any 
irrationality, and the very act of ignoring any return beyond some 
limited planning horizon as we know entails a positive time prefer- 
ence. Thus, though the intransitive and irrationally impatient behay- 
ior Strotz illuminated can of course also occur, his nihilistic general 
conclusion about the inapplicability of consumer sovereignty to in- 
tertemporal choices is incorrect. 

Our analysis is in closer accord with writings of Koopmans, Dia- 
mond, and Williamson (1964) and Hirshleifer (1970). These writers 
also conclude that there is positive time preference, which they define 
somewhat differently. They do not, however, offer any “empirical 
test” of the sort we used, nor go into shorter time horizons and 
drastically diminishing marginal utility. Koopmans et al. (1964) prove 
that various sets of formal postulates entail the existence of positive 
time preference. The formal postulates are naturally chosen in part 
because of their tractability, and it is not always immediately clear 
whether these postulates are very restrictive or succeed in capturing 
the essence of the matter. It is not immediately clear, for example, 
whether Diamond’s (1965) assumption that each period’s utility v(-) is 
bounded is limiting, or even material to the proof, or whether it 
matters that these authors exclude the possibility that v’(Cy) = ~ or 
that v'(C,) = 0, which we allow. The fact that more work needs to be 
done before the generality and applicability of the Koopmans (1960) 
and Diamond (1965) proofs can be definitely established is perhaps 
best demonstrated by quoting Koopmans’s briefest summary of the 
reason for positive time preference in his article: “. . . if there is in all 
circumstances a preference for postponing satisfaction—or even 
neutrality toward timing—then there is not enough room in the set of 
real numbers to accommodate and label numerically all the different 
satisfaction levels that may occur in relation to consumption levels for 
an infinite future” (1960, p. 288). Hirshleifer (1970, p. 96), using a 
line of reasoning much closer to ours, concludes in an important if 
cryptic passage that zero time preference would imply infinite defer- 
ral of all consumption. That is closer to the mark but overlooks some 
of the possibilities. In general, if the argument in our paper Is correct, 
it suggests that the contributions we have just cited deserve more 
attention and perhaps further extension as well. 

More recently, Stigler and Becker (1977) have put forth conclusions 
that directly contradict those of Strotz (1956), Koopmans et al. (1964), 
and Hirshleifer (1970). Stigler and Becker argue that many types of 
behavior that economists have traditionally characterized as special 
patterns of taste are in fact explicable in terms of different prices and 
incomes. One of the ad hoc assumptions about tastes to which Stigler 
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and Becker especially object is the assumption, going back as we know 
to Bohm-Bawerk, that people “systematically undervalue . . . future 
wants.’ The taste for consumption in say 1984 is alleged to continue to 
shift upward as 1984 gets closer to the present. In spite of the impor- 
tance frequently attached to time preference, we do not know of any 
significant behavior that has been illuminated by this assumption. 
Indeed, given additional space, we would argue that the assumption 
of time preference impedes the explanation of life cycle variations in 
the allocation of resources, the secular growth in real incomes, and 
other phenomena” (Stigler and Becker LOA12 p:789): 

It might seem that the quoted passage by Stigler and Becker con- 
tradicts our own findings no less than those of the authors we have 
cited, since we also claim to have demonstrated the existence of 
positive time preference. This is not in fact the case; one could 
consistently accept Stigler and Becker’s conclusion and our own as 
well. We have defined positive time preference in a slightly broader 
and more readily testable fashion than Stigler and Becker have, and 
we neither assumed nor ruled out irrationality or perceptual error. 
Accordingly, it is, as we argued, an empirical question whether there 
is time preference, and the revealed preference approach made it 
possible to assess the evidence. It would of course be possible to 
discuss the question of why bequests are as small as they are, or why 
some people have only finite planning horizons, with different ter- 
minology, though excluding the phrase “time preference” and 
synonyms for it would in our judgment involve circumlocution.” 


XI. Conclusions 


Our principal findings are the following: 

1. A finite time horizon implies that only consumption inside the 
horizon has value and signifies an absolute unwillingness to transfer 
goods to children or other descendants who could benefit from them 
after a specified future date. Thus a finite planning horizon is both 
theoretically and practically equivalent to a positive rate of pure time 
preference. Planning horizons that do not reach beyond the lifetimes 
of those who make the plans can exist, and (though this implies a 
remarkable discontinuity) it is logically possible that within such 
lifetime horizons time preference could be zero or even negative. But 
this possibility can have little significance in comparison with the 
high rate of time preference implied by the short horizon, and in any 
event it does not seem feasible to disentangle time preference during 


' Stigler and Becker’s footnote assertion that “a consistent application of the as- 
sumption of stable preferences implies . . . the absence of time preference” (1977, p. 78, 
n. 4) is either mistaken or dependent upon definitions of “stable preferences” or “time 
preference” that appear to be untypical. 
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the course of a lifetime from other changes over the life cycle. Time 
horizons which imply no bequests for descendants also cannot be the 
norm, since they would imply lower savings than are observed and no 
social saving whatever in a society in a stationary state. We have also 
shown that finite time horizons and positive time preference are 
functionally equivalent, so the most interesting question is whether 
there is positive time preference within an infinite time horizon. If 
there is, there is no need to go into the length of planning horizons, 
and the analytically convenient assumption of an infinite time horizon 
can be used without significant loss of generality. 

2. An infinite time horizon and a zero rate of time preference 
suggest that (if the foreseeable rate of economic growth is less than 
the expected long-term rate of interest) current consumption would 
be no higher than the subsistence level. Over an indefinitely long time 
span continued positive interest rates imply such colossal gains in 
future consumption from reductions in current consumption that all 
income beyond that needed for survival would be saved. 

3. There is, however, the alternative possibility that even vastly 
higher consumption is expected to bring so little increase in utility 
that households are unwilling to forgo consumption now, even 
though this can generate essentially limitless increases in consumption 
later. The possibility that there will be absolute satiation at these 
future consumption levels can be rejected on the classical grounds 
that wants are limitless, but there remains the intriguing possibility 
that there is what we here labeled “drastically diminishing marginal 
utility”: Even boundless increases in future consumption would then 
bring such modest increases in utility that the low levels of saving we 
observe could be rational even with a zero rate of time preference. 
The observed levels of saving imply positive time preference, or 
drastically diminishing marginal utility, or both of these interesting 
phenomena. In an age that has seen secularly rising incomes, it might 
seem that drastically diminishing marginal utility is a type of positive 
time preference, since an unwillingness to bequeath much to descen- 
dants, on the grounds that they will be better off than we are and will 
not get much satisfaction from still higher consumption, provides a 
good rationale for preferring current consumption and leads to the 
same observed savings levels. This is not in fact correct. If there were 
a sufficient technological regression or other losses in endowments, 
yet a positive interest rate, zero time preference would imply that 
current consumption would be reduced to subsistence levels, but 
drastically diminishing marginal utility would not. Similarly, if for 
shifts of goods within their own lifetimes people behave as if they had 
Bernoulli utility functions, intertemporal consistency would require 
the use of positive time preference to reconcile this behavior with 
their gifts and bequests. Also, whether drastically diminishing utility is 
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present or not, finite planning horizons entail positive time prefer- 
ence. Drastically diminishing marginal utility also may be needed to 
explain other phenomena, such as a willingness to pay a high price to 
insure against even modest losses or to reduce fluctuations in con- 
sumption levels, and savings levels in particular cases can be traced to 
drastically diminishing marginal utility according as its other implica- 
tions are present or absent. 

4. Uncertainty about the future has double-edged effects. Of 
course, if the typical household thinks that there is a greater 
likelihood that part of its bequest will merely be destroyed in a 
future catastrophe than that the full bequest adds to the probability 
that descendants will survive well in that catastrophe, the expected 
rate of return to saving is thereby lowered in their minds. That 
expectation, instead of time preference, could in principle account for 
high consumption and low saving. However, each type of uncertainty 
we can think of offers possible reasons for saving more as well as 
reasons for saving less. As in the other cases, it is hard to see a simple 
way to distinguish it empirically from pure time preference. Thus for 
purposes of this inquiry we have made the simplest assumption: that 
uncertainty about whether any marginal savings will be lost is on 
average about offset by the incentives for additional saving that un- 
certainty about the future creates. To make a better specification 
possible in any further studies, we have made clear above that the 
prevalence of futures and insurance markets, the degree of risk 
aversion, the degree of confidence in predictions about the future, 
and the costs of diversification or protection of portfolios each has 
distinct effects on savings rates, and these might eventually be esti- 
mated. 

In summary, the case for positive time preference is absolutely 
compelling, except in the interesting case where there is drastically 
diminishing marginal utility. Utility functions of this sort have in- 
teresting implications of their own which need to be tested separately. 
Time preference is accordingly by no means a subject that deserves 
only casual references to alleged perceptual errors or erroneous in- 
ferences from the existence of positive interest rates. It is a phenome- 
non of great practical and theoretical importance that is fortunately 
quite amenable to analysis. 
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Both economic theory and casual empirical observation of the U.S. 
economy suggest that spending propensities from temporary tax 
changes are smaller than those from permanent ones, but neither 
provides much guidance about the magnitude of this difference. 
This paper offers new empirical estimates of this difference and 
finds it to be quite substantial. The analysis is based on an amend- 
ment of the standard distributed lag version of the permanent in- 
come hypothesis that distinguishes temporary taxes from other in- 
come on the grounds that the former are “more transitory.” This 
amendment, which is broadly consistent with rational expectations, 
leads to a nonlinear consumption function. Though the standard 
error is unavoidably large, the point estimate suggests that a tempo- 
rary tax change is treated as a 50-50 blend of a normal income tax 
change and a pure windfall. Over a 1-year planning horizon, a 
temporary tax change is estimated to have only a little more than half 
the impact of a permanent tax change of equal magnitude, and a 
rebate is estimated to have only about 38 percent of the impact. 


In 1968, faced with a classic case of demand inflation, Congress 
enacted a temporary increase in personal income tax payments to 
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curb aggregate demand. In 1975, near the trough of our worst post- 
war recession, Congress enacted a tax rebate and other temporary 
decreases in taxes and increases in transfer payments designed to 
stimulate aggregate demand. Questions have been raised about the 
effectiveness of both measures. 

The questions have both theoretical and empirical roots. On 
theoretical grounds, the permanent income-life cycle hypothesis 
seems to argue that temporary income tax changes should have little 
effect on consumer spending in principle. On empirical grounds, data 
on consumer behavior seem to suggest that the impacts of the two 
temporary taxes on spending were also small in practice. In 1968, the 
savings rate fell from 7.5 percent in the quarter immediately preced- 
ing the tax surcharge (1968:2) to only 5.6 percent in the first quarter 
of the surtax, suggesting that consumers kept spending despite the 
tax. In 1975, the savings rate ballooned from 6.4 percent just prior to 
the rebate to a stunning 9.7 percent in the quarter of the rebate 
(1975:2), suggesting that little of the rebate was spent. 

The purpose of this paper is to study these two temporary tax 
changes in some detail. Precisely what prediction does economic 
theory make about the relative effectiveness of temporary versus 
permanent tax changes? And what conclusions can be reached from 
U.S. time-series data? The fact that the Carter administration asked 
for (but did not get) a repeat performance of the rebate in 1977 
suggests that there is more than academic interest in the answers to 
these questions. 

Section I outlines the theoretical issues, beginning with an idealized 
life-cycle model and proceeding to introduce some important “real 
world” considerations. Since the discussion shows quite clearly that 
the issue is an empirical one, Section II reviews previous empirical 
work on the subject very briefly. Section III explains the underlying 
basis of the empirical model of this paper, relating it to recent litera- 
ture on rational expectations and the permanent income hypothesis 
(PIH), and then Sections IV and V show how this basic conceptual 
framework was converted into an operational empirical model. The 
estimates are presented and analyzed in Section VI, and Section VII 
summarizes the main conclusions. 


I. The Implications of Theory: Pure and Impure 
1. The Pure Permanent Income—Life Cycle Theory 


As Eisner (1969) pointed out some time ago, the PIH casts doubt on 
the effectiveness of income tax changes that are labeled as temporary 
because such measures have only minor effects on permanent income. 
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To develop a theoretical benchmark for the marginal propen- 
sity to consume (MPC) that the PIH suggests might apply to a tempo- 
rary tax, consider a rarefied world in which consumers with exoge- 
nous earnings streams select consumption paths to maximize lifetime 
utility. If capital markets are perfect, only the discounted present 
values of the earnings streams matter, so suppose all households earn 
a constant income y per year. Assume further that households differ 
only in age, a; that the real rate of interest is zero; and that the 
subjective rate of time discounting is also zero.' The question is, If 
income taxes are raised by z per capita for the period from¢ = 0 tot = 
t,, how much less will consumers spend over this interval? 

In answering this question, there are three population groups to 
keep track of. People who are “alive” (in the economic sense) at ¢ = 0 
and who live past the expiration of the tax suffer an income loss of ¢,z 
over the period. If T denotes the length of life, then each such person 
of age a consumes a fraction t,/(T — a) of this loss during the years in 
which the temporary tax is in effect. Thus the change in consumption 
per capita is AC, = zt#/(T — a). 

Old people who are alive at ¢ = 0, but die before ¢ = ¢,, lose only (T 
— a)z in income. However, since they have MPCs of unity during the 
surtax period as a whole, their change in consumption per capita is 
AGs = @ — a)z. 

Finally, we must worry about people who are born between t = 0 
and ¢ = ¢,. Ifa, a negative number between 0 and —1¢,, denotes the age 
of such a person, and he lives for ¢; + a <¢, years during the period 0 
<¢t <¢,, his income loss is (¢; + a)z. Since he spends a fraction (t, + 
a)/T of this income during the period 0 < t < ¢,, the change in his 
consumption is AC; = [z(t, + a)?]/T. 

To derive the aggregate change in per capita consumption, weight 
these groups by the age distribution, considering the ages —t; <a < 
T. In the simplest case of a uniform age distribution, f(a) = 1/T, the 
change is 


kG ANE = ONG 
NG =| ae J Z oda 
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Working out the integrals and dividing by the total income that is 
taxed away during the period (¢,z per capita), we obtain 





MPC = (log T ~ log 4) + [1 - a +e fi 


2t, 21h 5 


where the three terms show the contributions of the three different 
population groups. 


‘This is essentially the model introduced by Modigliani and Brumberg (1954). 
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To take a concrete example, suppose the typical lifetime of a 
household head as a household head is T = 50 years. Then, according 
to this formula, the MPC for a 1-year tax (t, = 1) is .09, while that fora 
2-year tax (t,; = 2) is .15.2, The MPC = .09 fora 1-year temporary tax is 
only a rough benchmark representing the pure PIH, and there are a 
number of reasons why the theory probably systematically under- 
states the responses of consumers to temporary taxes (see below), so 
we should not take this number too seriously. Still, there are two 
lessons worth drawing from this simple exercise—lessons that have 
often been forgotten in the temporary-tax debate. 

a) Income gains and losses from temporary taxes will eventually be 
spent just like any other increment or decrement to lifetime re- 
sources: if less is spent at first (because t, < T), then more will be spent 
later. Thus if we want to inquire about the “effectiveness” of tempo- 
rary taxes, we must specify a time horizon. Over a long enough run, 
they must be just as “effective” as permanent ones. 

b) The so-called zero effect view—that consumers ignore the surtax 
and consume as if it never happened—does not represent the PIH at 
all. Instead, that theory says that consumers should spend precisely 
what they would on receipt of a windfall gain (or loss) of ¢,z. In the 
illustrative calculation, this turns out to be the “9 percent effect” view. 


2. Caveats and Imperfections 


There are several reasons why surtaxes may affect spending more 
strongly than indicated by pure theory. First, tax-induced income 
changes that are not consumed must be saved. If windfall gains are 
used to purchase durable goods, consumer spending may rise much 
more strongly than consumption; the converse may happen when 
there are windfall losses. The magnitude of the marginal propensity 
to spend windfalls on durable goods is, of course, an empirical ques- 
tion.’ 

Second, some households may be subject to liquidity constraints 
that are usually ignored by the PIH. If we stay within the certainty 
context, these constrained households will react strongly to even tem- 
porary income changes.4 Thus the aggregate MPC for a temporary 
tax is a weighted average of the low MPCs of unconstrained house- 
holds and the high MPCs of constrained ones. Again, the importance 
of this phenomenon is an empirical question. 


2 By way of comparison, as t; > T, the MPC’ 5/6. 

3 See, e.g., Darby (1972). : 

4 See Blinder (1976) and Dolde (1978). For a look at one particular type of uncer- 
tainty, see Foley and Hellwig ( 1975), which shows that this result may not carry through 
to the uncertainty case. 
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Third, as Okun (1971) pointed out, consumer behavior depends on 
what people believe rather than on what the government announces. 
If consumers disbelieve the government when it tells them that a tax 
hike is only temporary, then the spending response will be greater 
than that suggested by a naive application of the PIH.> Since the 
perceived duration of the surtax, not the declared duration, is rele- 
vant from the standpoint of the PIH, this too raises an empirical issue. 

Finally, we must recognize the possibility that households may not 
do the kind of rational long-term planning envisioned by Modigliani 
and Brumberg (1954) and Friedman (1957) or, what amounts to the 
same thing, have very high subjective discount rates. If they are very 
shortsighted, then temporary fluctuations in disposable income may 
have substantial effects on spending. 


II. Previous Empirical Work 


Okun’s (1971) study opened the empirical debate on this issue. Using 
the consumption equations of four econometric models, he compared 
the “full effect” view that the 1968 surtax was just as effective as a 
permanent tax increase to the “zero effect” view that consumers totally 
ignored the surtax. While he concluded that the full effect view fit the 
data better, an intermediate “50 percent effect” view actually does 
better than either extreme.® Springer (1975) criticized Okun’s 
econometric procedures and then performed a similar experiment 
with a consumption function based on the PIH. He concluded that 
the zero effect view performed better. 

Juster (1977), using a series of savings equations based on the 
Houthakker-Taylor (1966) model, reached conclusions about the 
1975 rebate similar to those of Okun for the 1968 surtax. But Modig- 
liani and Steindel (1977) found that the rebate had very little impact 
over a horizon of 1 or 2 quarters. Their modified version of the 
life-cycle model implied, however, a virtually full effect view over a 
6-quarter horizon. Modigliani and Steindel assumed that the nonre- 
bate portions of the 1975 tax cuts were treated like permanent taxes 
and handled the 1968 surtax with dummy variables. 

The existing empirical literature thus offers little consensus. The 
issue seems quite open. 


III. The Distributed Lag Model of Consumption 


The basic vehicle for investigating the effectiveness of temporary 
income taxes in this paper is the distributed lag version of the PIH. 


* This point has rather less cogency with respect to tax cuts; but here too consumers 
may believe them to be more permanent than the government announces. 
® On this, see Blinder and Solow (1974, pp. 107-9). 
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While this is the standard way of implementing the PIH empirically, 
the recent literature on rational expectations has seemed to raise 
doubts about its validity.’ This section shows how the PIH and 
rational expectations together lead to an estimating equation very 
much like the one I use in this paper. 

As Muth (1960) pointed out, the PIH is basically a forward-looking 
model of consumer behavior. It states that consumers, in deciding on 
their current spending, weigh their current asset holdings, their cur- 
rent income from labor, and their expected future income from 
labor. Specifically, permanent income at time ¢ is defined as 


= v7 
yP =A ae oS tae bains! 1 
pee Ut ry? (1) 


where 4, is the stock of real assets at the beginning of period ¢, Y; is 
noninterest income in period ¢t, and ;Y;4, is the mathematical (.e., 
rational) expectation of Y;,, that is formed at time ¢. (By convention, 
‘Y; = Y;.) However, if the stochastic process generating income can be 
described by a time-series model such as 


ig Net oN go ec ee Gy h eet an Sts (2) 


where €, is a white noise error term, then the resulting empirical 
model of consumption will be backward looking. For example, if the 
theoretical consumption function is 


CoO a kV Uy, (3) 
then the empirical consumption function will be 
(oe = 6 ce kA, a bo: a Day a + aoe y a De ee ca Ut. (4) 


To see this, it is only necessary to note that the (rational) expecta- 
tion of income in period t + s can, in view of (2), be based only on the 


information set {Y;,, Yi, Yr-2,.-- }. Thus, for example, 
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and so on. Substituting all such expressions into the definition (1) and 
then into the consumption function (3), it is clear that (4) is derived. 
As pointed out by Sargent (1978) and others, the coefficients }; in 
equation (4) will be complicated functions of the coefficients a; in (2). 

Suppose then that, as suggested by Dolde (1976), we can distinguish 
among two or more sources of income whose generating functions (2) 
may differ. The PIH in conjunction with rational expectations then 


7 See Lucas (1976) and esp. Hall (1978). 
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implies that the b’s in (4) should follow a different pattern for each 
income source. A simple example will illustrate this point and also 
give us some feeling for possible magnitudes. Consider several in- 
come sources, each of which is generated by a first-order autoregres- 
sive: 


Vin Pi) aan ae Ga (5) 


Working out the expectations and plugging into (1) gives us a simple 

expression for the permanent income attributable to each source: 
Vee Ey eee ite ine (6) 

ap; 

Notice that, despite the long time horizon contemplated by the PIH, 

consumption depends only on current income. In general, consump- 

tion will depend on past income only up to lag n, where n + | is the 

longest lag considered in equation (2). 

Now compare two income sources, one of which is entirely perma- 
nent (p = 1) and the other of which is entirely transitory (9 = 0). A 
$1.00 increase in the permanent component will, according to (6), 
raise permanent income by $(1 + r)/r and thus raise consumption by 
$k(1 + r)/r. This may imply a very large immediate spending re- 
sponse.’ By contrast, a $1.00 fluctuation in the purely transitory 
component will, again according to (6), raise permanent income by 
only $1.00 and thus raise consumption by only $k. The lesson, of 
course, generalizes and applies far beyond the confines of first-order 
autoregressives: income sources deemed to be more permanent will elicit 
prompter spending responses than income sources deemed to be more temporary. 
The application of this principle to permanent versus temporary 
changes in taxes is apparent and immediate and was elucidated clearly 
by Lucas (1976). It is the basic notion underlying the empirical model 
to be developed in the next section. 

However, lest confusion arise, I should stress that there is no sense 
in which the rationality of expectations is either assumed or imposed 
in the consumption functions estimated here. My point is only that the 
distributed lag formulation of the PIH is consistent with rational 
expectations. As Sargent (1978) has emphasized, rational expectations 
delivers a set of restrictions across equations (2) and (4) that can be 
imposed in estimating the two jointly. I have made no attempt to 
impose these restrictions here because my interest was in getting the 
best possible consumption-function estimates, not in testing 
rationality. Furthermore, it is well known that quite different models 


* Suppose the rate of subjective time discounting is equal to the rate of interest, so 
that a constant consumption stream is optimal, and that B is the lifetime propensity to 
consume (i.e., 1 — B is the propensity to bequeath). Then k will be Br/(1 +r), so that k(1 
+ r)/r will be B, which is close to unity. 
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of consumption behavior (e.g., habit persistence) can lead to an es- 
timating equation very much like (4). It is not my purpose to discrimi- 
nate among alternative ways of arriving at (4). 


IV. Derivation of an Estimating Equation 


The preceding discussion makes it clear that different distributed lag 
coefficients might be associated with different sources of income. 
While the actual empirical analysis considered four types of income, 
the model is most readily explained if I suppose there are only two: 
income (positive or negative) attributable to temporary tax measures, 
which I denote as S, (“special income”); and all other dis posable 
income (“regular income”), which I denote as R;. The R, should not be 
confused with permanent income, since it has both permanent and 
transitory components. The basic idea underlying the estimating 
equation is that S; is identifiably “less permanent” than R,. 

Suppose consumption responds to R; according to a set of distrib- 
uted lag weights: w; = 0C,/OR;_;,7 = 0, 1,..., n. Since the w; depend 
on the stochastic process generating R,, it is worth reporting that the 
deviations of R, from a logarithmic time trend are well described by 
the following second-order autoregressive:® 


‘et = 1.28y,_4 — Riley ey fae = 91, D-W = 1.86. 
(09) — (.09) 


When income follows a second-order autoregressive, permanent in- 
come as defined in (1) is: 


(+r? 
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it dee oss (l a 7) 1 + 7, where K; is the present value of the trend 
component of labor income and y, and y,;_; are current and lagged 
deviations from trend. Given the estimates of a, and a, above, and for 
r = .0074 (a 3 percent annual real interest rate), the implied 
coefficients are Y? = A, + K; + 13.5y, — 4.7441. This leads us to expect 
a very large value of wo, followed by swiftly declining w’s—possibly 
even turning negative. The empirical results bear this out. 

As Lucas (1976) has argued, income changes that are clearly “more 
temporary” than regular income should get different spending 

9 Standard errors are in parentheses. Longer autoregressives, however, give slightly 
better fits. E.g., ift — 11 is the longest lag allowed to enter the regression, significant 
coefficients are obtained at lags 1, 2, 3, 4, and 11. An F-test for the zero restrictions 


implied by the second-order model, however, yields an F-ratio of only 1.77, which is 
well below the critical 5 percent point of the x5 distribution. 
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coefficients. To develop a model of the distributed lag response of C; 
to S;, first break down S, into its components: 


Sha Se Se te St pe (7) 


where Si indicates the income gain or loss in quarter ¢ from the 7th 
temporary tax. (In the empirical work, m = 3.) It will help clarify the 
treatment of the Si if I define a hypothetical set of lag coefficients f; as 
the effect on C, of a $1.00 pure windfall gain received in quarter ¢ — 7. 

The treatment of S} depends on whether or not the 7th temporary 
tax is still in effect. If it is, I assume that Si is treated as a weighted 
average of regular and windfall income, so that it gets the distributed 
lag weights 
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If the temporary tax is no longer on the books, I assume that consum- 
ers look upon Si_, in retrospect as if it had been a pure windfall and so 
apply the distributed lag coefficients 8;. By introducing a dummy 
variable defined as: 

Di = 1 if the 7th temporary tax remains in force in quarter f¢,_ 

= (0 otherwise, 


it is possible to combine these two hypotheses into a single expression: 
yi(t) = Dildw; + (1 — d)Bi] + (1 — DDB; (9) 


where the notation now indicates that the y weights depend both on 
calendar time and on the specific tax under consideration (because of 
the dummy variable). 

An interesting point arises here. Standard pre-rational-expec- 
tations approaches to consumption-function estimation would sug- 
gest that 48; and Xy; be constrained to equal 2w;, apparently 
meaning that the “long-run MPC” out of any type of income is 
identical. However, the PIH-cum-rational-expectations approach 
suggests no such adding-up constraint. To see this, follow Sargent 
(1978, pp. 681-82) in rewriting (2) in the form X; = HX;_, + 
(Sargent’s eq. 8), where 
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and Y, = dX,, where d = (1, 0,..., 0). As Sargent notes (his eq. 9), 
rational expectations implies ,X,,, = H*X;, whence irs = AH®X;. 
Substituting this into (1) gives the following expression for permanent 
income: 


oO 


aH® 
YP=A,+ > —_——— 
ae 2 a J 


which is of the form (4) with coefficients 


eo 


baaige tks 4 d=. 


s=0 


dH? 
(+r) 


The sum of the ); has no obvious interpretation. Thus, in a model 
with several sources of income, there is no particular reason why the 
various sets of distributed lag coefficients should have acommon sum. 

Where, then, does the lifetime budget constraint enter? The answer 
is that (4) implies a unitary lifetime MPC for any values of k and the bj. 
The proof involves some straightforward but tedious algebraic ma- 
nipulations of the difference equations (4) and 


Aga An tng = Cy (10) 


and hence is relegated to Appendix A. 

With these preliminaries out of the way, it is easy to explain the 
estimating equation. If there were no special taxes to worry about, the 
basic empirical model of consumer behavior would be as follows: 


n 
G, — ko aa Betsy ae pa 
j=0 


rs (11) 
PR Vp Ag) ShsAes + Ut, 


J=0 


where r; is the rate of interest, W;, is consumer net worth at the 
beginning of period ¢, and A, is the market value of stock market 
wealth at the beginning of period ¢t. The specific way in which assets 
are entered into the consumption function, including the constraint 
thatks + ky +... + kag = ho, 18 Suggested by the MIT-Penn-SSRC 
(MPS) model and is unimportant to what follows. 

Now consider the separation of disposable income into its two 
components: 


Y,=R;, oes (12) 


The way I have defined the y’s means that (11) is expanded to: 


q 
Cy = ho + RyriYe + ho(Wi — At) + Deleiitig 
j=0 
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n n 
+ > wR + >) OSL (13) 
iO) g=0 
n 
+... + > yPWOSH, + 
j=0 


Substituting (9) and (12) into (13), and rearranging terms, gives: 


q 
Cy = ko ap kari. oi k.(W, om A;) ae oe 


j=0 


+ > wj(Y ej + AX} — S14) (14) 


j=0 
n m ; ; 

+ D’8[d — xi + Di  - Di)St-5| + Ue 
j=0 i=1 


where X? = DiS}, +... + DPS2,,7 = 0, ... 7.1 his is not theactual 
estimating equation because additional income sources were distin- 
guished, because the distributed lag coefficients were constrained in 
several ways, and because corrections were made for both hetero- 
scedasticity and serial correlation in the error term. Details are spelled 
out in Appendix B. Nonetheless (14) is the most useful form for 
interpreting the estimated parameters. The model is nonlinear be- 
cause of the parameter A—the crucial parameter of this study. 

As noted above, theory does not imply that {w; = £6;. To investi- 
gate this further, this adding-up constraint was imposed in (14) and its 
validity tested as follows.!° Let @ denote the likelihood ratio. Then, 
under the assumption of normality, —2 log € = T log (SSR,/SSR) is 
distributed as a x? with r degrees of freedom, where: T = number of 
observations (= 100 throughout this paper), SSR = minimized value 
of the sum of squared residuals in the unconstrained regression (eq. 
[14] in this case), SSR, = minimized value of the sum of squared 
residuals in the constrained regression (obtained by imposing £8 = 
Xw), and r = number of restrictions (= 1 in this case). As reported in 
table 1, row 1, the constraint was rejected at the 10 percent level but 
not at the 5 percent level. There being no persuasive theoretical 
rationale for it, the constraint was dropped. 

Tacitly, however, (14) embodies a number of other constraints that 
are equally lacking in theoretical justification—constraints that each 
type of regular income is subject to the same set of distributed lag 
coefficients. These constraints were tested by a series of likelihood 
ratio tests, which are described in the balance of this section. 


© See Goldfeld and Quandt (1972, PA): 
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TABLE 1 


x? Tests OF CONSTRAINTS 

















Unconstrained Constraint Test 
Model Tested df Statistic 
LE qu4) Sa 1 3.67 
2a BiG iL) v; = w; for all) 3 12.02 
3. hg? (16) dv; = Lu; 1 Approximately 0* 
4. Eq. (18) with tv = lw ; = w; for all 7 3 19.02 
5. Eq. (18) with Sv = Lw Xo; = Dw, 1 12.52 
6. Eq. (18) with tv = Sw NO) ] 132 
7. Eq. (18) with Lv = Sw n= Il ] 2.84 
8. Eq. (18) with =v = Sw =O ] 17.81 
Note.—Critical levels for the x? distribution are: 

df 10% Point 5% Point 1% Point 

1 2.71 3.84 6.63 

3 6.25 7.81 11.34 





*Due to rounding error, the actual computed test statistic was slightly negative. When this constraint was tested in 
the context of eq. (18), it produced a test statistic of 1.32. 


First, “regular” disposable income was disaggregated into its two 
main components—personal income and “regular” personal taxes: 


oe (15) 


Personal income is assumed to be spent according to the lag weights 
w;, while regular taxes are assumed to be subject to a different set of 
lag weights v;. Special taxes are treated as previously explained, ex- 
cept that the v’s replace the w’s in equations (8) and (9). That is, while 
the tax is on the books, a special tax is treated as a weighted average of 
a permanent tax and a windfall. Thus the basic consumption function 
becomes: 


q 


Cia Ng + hil et Wi A > kasd ens 


j=0 


+ >) wiP is S' a(Ti-3 — 9X4) (16) 


j=0 j=0 
+ YA) — xi + DC — DSL] + He 
j=0 i=1 
Since (as explained below) the distributed lag coefficients are con- 


strained to follow a third-degree polynomial with a zero end-point 


‘1 In making this separation, I departed a bit from national income accounting 
conventions by including the employer’s share of social insurance contributions in both 
P and T. 
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constraint, the null hypothesis that the v; are equal to the w; imposes 
three constraints on equation (16). Row 2 of table 1 shows that these 
constraints were resoundingly rejected by the data (x3 = 12). How- 
ever, it turned out that the sum of the v; was estimated to be almost 
exactly equal to the sum of the w; (see row 3 of table 1), so this 
adding-up constraint was imposed in subsequent estimates. 

The final generalization considered was to disaggregate personal 
income into its two main components—factor income and transfer 


payments:” 
P,=F,+V:z. (17) 


This made the estimating equation: 


q 
Cy = ho Fh, ke(Wy — Ay) + Die 
j=0 


n 


+ SwyFes + OV 1-3 — D0;(Ti5 — 2X) (18) 
j=0 j=0 


5=0 
ss >A (1 a) ay > = Di)Si-5|+ Ug. 
IW i=] 


Once again, the null hypothesis that the @; (spending coefficients for 
transfers) are in fact equal to the w; (spending coefficients for factor 
income) was tested by a x? test. And once again it was resoundingly 
rejected (x3 = 19; see row 4 of table 1). This time, however, the data 
also rejected the adding-up constraint £4; = Sw;, which therefore was 
not imposed (table 1, row 5). In fact, most of the difference between 
the ¢; and the w; was in their sums; the time patterns were remarkably 
similar. 

To summarize these tests, we are left with a model that assigns 
distributed lag weights w; to factor income, ¢; to transfers, vu; to 
regular taxes, and 8; to windfalls. The {v; and Sw; are apparently 
equal, but the other sums are not. 


V. Issues in Estimation 
1. Data 


Following the suggestion of Darby (1975), I used consumer ex- 
penditures, rather than pure consumption, as C;. This seems most 
appropriate where the focus is on the evaluation of stabilization pol- 
icy, as it is here, rather than on testing the PIH. Furthermore, it 


_"* For this purpose, both employer contributions and business transfers are con- 
sidered to be factor income, and the aspects of the 1975-76 tax cuts that are classified as 
transfer payments in the national income accounts are grouped with temporary taxes. 
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TABLE 2 


EFFECTS ON DISPOSABLE INCOME oF 1975—76 
‘TAX Cuts AND TRANSFERS* 














TaAxest ‘TRANSFERS+ 
QUARTER Tax Rate Cuts Rebate Social Security ee TOTAL 
G75is2 8.5 S12 6.7 0 46.4 
LOTS 123 0 0 0 12.3 
197534 ES, 0 0 0 11.9 
1976: 1 ie 0 0 IES 16.1 
1976:2 14.0 0 0 1.6 15.6 





* In billions of current dollars, at annual rates. 
7 From U.S. Bureau of Economic Analysis (February 1976, March 1977). 
+ Kindly supplied to the author by Joseph C. Wakefield, of the Bureau of Economic Analysis, in conversation. 


avoids many complicated issues of definition (e.g., Which goods are 
durables? How fast do they depreciate? etc., etc.). The cost of this 
shortcut is that the theoretical interpretation of some of the param- 
eters is lost. For example, k, includes the effects of 7; on both pure 
consumption (which may be positive or negative) and spending on 
durables (which should be negative). Similarly, the lag weights w; 
might be expected to be less smooth than the lag of pure consumption 
behind income because of the lumpy nature of durables.'° 

Data for the 1968 surcharge were taken from Okun (1971) and 
converted to 1972 dollars by the deflator for personal consumer 
expenditures; these comprise S/. Data for the various 1975-76 tax 
cuts are shown in table 2; they were similarly deflated and segregated 
into two time series. The S? series is defined as the explicitly one-shot 
measures: the tax rebates and the social security bonuses (hereafter 
referred to as “the rebate”). The rest is considered as S?. Since the 
1975 cuts were extended several times and are now a permanent 
feature of the tax code, an arbitrary decision had to be made as to 
when they became “permanent.”'* I decided to cut off S? after 
1976:2, because by then the cuts had already been extended once in 
the Revenue Adjustment Act of 1975 and a second time in the Tax 
Reform Act of 1976. 


13 'To account for the special features of expenditures on consumer durables, several 
additional variables were tried in some earlier regressions. Neither the stock of dura- 
bles, nor the relative price of durables, nor the unemployment rate succeeded in 
significantly lowering the sum of squared residuals. In several cases, the signs of the 
coefficients were ‘even the opposite of what theory suggests. It may be that these 
variables are more relevant to the choice between saving in the form of durables vs. in 
financial form than they are to the choice between spending and saving. 

14 At an early stage of this research, I experimented with a learning model in which a 
temporary tax gradually came to be considered permanent as it remained on the books 
longer and longer. This experiment was unsuccessful. 
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Data on consumer net worth (W;), and its breakdown into stock 
market (4;) and non-stock market (W, — A;) components, were taken 
from the data bank of the MPS model and converted to 1972 dollars. 
They are based on a number of primary sources, the most important 
of which is the flow of funds. 

Because of the recent findings of Boskin (1978), I thought it im- 
portant to use a real after-tax interest rate. Since the construction of 
this series was a fairly involved affair, I explain it only in Appendix C. 


2. Distributed Lag Estimation 


The many distributed lags in equation (18) were estimated by an 
adaptation of the Almon (1965) lag technique, as a method of con- 
serving on parameters. Generally, a third-degree polynomial with a 
zero constraint at the far end was used. Preliminary tests suggested 
that these end-point constraints (w+, = 0, etc.) could not be rejected. 
An unconstrained version of one specification, run as an experiment, 
showed that the polynomial constraint had very little effect on the w; 
coefficients but did influence the B; coefficients. 

The distributed lag effects of assets were estimated as follows. In 
some preliminary regressions, the two components were combined 
and a cubic distributed lag over n quarters was estimated. Then the 
two components were disaggregated. These preliminary tests showed 
quite clearly (a) that the coefficients were not the same and (bd) that the 
lag was much shorter than n quarters. (As explained just below, n was 
chosen to be 7.) In the case of non—stock market wealth, an estimated 
distributed lag over 4 quarters attached virtually all of the weight to 
the current (start of period) value, so all lagged values were omitted. 
In the case of stock market wealth, when a cubic was estimated over) 
=0,1,..., 7, the coefficients turned out to be almost linear and to be 
virtually zero after 7 = 2 (a small positive value for j = 3 and small 
negative values forj = 4,..., 7). Thusa linear distributed lag over) = 
0, ..., 3 was adopted as the final specification. 

The length of the distributed lag, n, was selected by running a 
preliminary version of the regression for alternative values of n 
ranging from 6 to 10. A very clear minimum in the sum of squared 
residuals was found around n = 7 orn = 8, with the former having a 
slightly better fit and slightly better coefficients. On this basis, n = 7 
was selected for all subsequent work. 


3. Treatment of the 1975 Rebate 


As has been noted already, the model divides the 1975-76 tax cuts 
into two parts: S? includes the rebate, while $3 includes all the rest. 
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This makes a strong (and questionable) assumption about how con- 
sumers treated the rebate. In particular, it assumes that they treated it 
just like the 1968 surcharge and the other 1975 reductions: essentially 
as if a fraction A of it was a regular increase in income, while a fraction 
1 — dX was a pure windfall. Given the nature of the rebate, this is 
questionable, to say the least. 

An alternative assumption—equally strong as the first—is that con- 
sumers treated the rebate as a pure windfall right from the start.!® 
While it is not obvious that this is true, since consumers might have 
anticipated a repeat performance with some reasonable probability, 
it does seem a plausible working hypothesis. Fortunately, it is not 
difficult to modify any of the three models to accommodate this 
alternative hypothesis; all that is necessary is that the lag weights 8; be 
applied to S% starting immediately in 1975:2. When this was done in 
one version of the model, the resulting equation had virtually an 
identical SSR, almost the same estimated \, and very similar implica- 
tions about spending patterns out of the rebate. Thus the conclusions 
of this study seem insensitive to the treatment of the rebate. 


VI. Empirical Results 
1. Parameter Estimates and Interpretation 


Estimation was done by the numerical optimization package devel- 
oped by S. M. Goldfeld and R. E. Quandt. The results from estimat- 
ing equation (18) on quarterly data covering 1953:1—-1977:4 are 
presented in table 3. The number in parentheses next to each esti- 
mated coefficient is its asymptotic standard error (or rather a numeri- 
cal estimate thereof). 

In interpreting the standard error of the regression, it should be 
mentioned that, as explained in Appendix B, the equation was ac- 
tually transformed so that the left-hand variable was the average 
propensity to consume (APC), C,/Y,, rather than consumer spending. 
Thus, the standard error of .0038 is relative to a typical value for the 
APC of about .90. This represents an excellent fit. At 1977 income 
levels, it translates to a standard error of about $3.5 billion in pre- 
dicting consumer spending. Of course, obtaining a good fit with a 
consumption function is hardly a notable achievement, and the 
equation—like most consumption functions—does suffer from some 


autocorrelation (p = .54). 


15 Since the rebate was off the books by 1975:3, and hence treated as a windfall in any 


case, only 1975:2 is at issue here. ; 
16 The standard errors of a comparable equation in Modigliani and Steindel (1977) 
are .0056 with an autocorrelation correction and .0065 without. 
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TABLE 3 


NONLINEAR CONSUMPTION FUNCTION PARAMETER ESTIMATES* 











ko ky ky xr p 
30.1 .0002 021 50 BOE: 
(7.1) (.0007) (.005) (.32) (.11) 





DISTRIBUTED LAG COEFFICIENTSt 











2 Wj $; U5 Bi Raj y; Rebate 
0 .60 (.05) 50 (.16) .34(.09) —.03 (.26)  .009 (.002) 16 16 
1 .16(.02) —.02(.09) .14(.04) —.03(.11)  .006 (.001) OS = 08} 
2 —.06 (.03) —.20(.10) .04(.05) —.01(.09)  .004 (.001) 10.2 ee OE 
3 —.12(.02) —.14(.08)  .01 (.05) .03 (.09) .002(.0005) .02 .03 
4 —.08 (.01) .04 (.06) — .02 (.03) .08 (.07) sig 05 08 
5 .02 (.01) .26 (.07) — .05 (.03) .11 (.06) <— .08 al 
6 .10 (.02) 39 (.09) .08 (.04) .12 (.08) Bic: .10 ale 
7 mlel(eO 2) 34 (.08) — .07 (.04) .09 (.07) re .08 .09 
Sum 74 eal 74 36 .021 (.005) 20D 55 





Nore.—4p = constant, k, = coefficient of interest rate, kp = coefficient of non-stock market wealth, \ = weight on 
regular income, p = autocorrelation coefficient. Sum of squared residuals = .00147, SE = .00383, SE of unadjusted 
errors (without a correction for autocorrelation) = .00455, N observations = 100. 

* Asymptotic SEs are in parentheses. 

+ Components may not add to totals due to rounding. 


Turning to the coefficient estimates, the most critical parameter for 
purposes of this study is A, the weight attached to regular income in 
equation (8). The point estimate of .50 suggests that temporary taxes 
that are still on the books are treated like 50-50 blends of windfalls 
and regular taxes. However, the standard error is regrettably large; 
there are, after all, pitifully few observations that can be used to 
estimate A. The null hypotheses 4 = 0 or A = 1 can nonetheless be 
tested by likelihood ratio tests. When these tests were run with equa- 
tion (18) as the unconstrained regression (see table 1, rows 6 and 7), 
the null hypothesis that A = 0 (temporary taxes are regarded as pure 
windfalls) could not be rejected (x7 = 1.32). But the null hypothesis 
that \ = 1 (temporary taxes are regarded as regular income) could be 
rejected if we were not too fussy about significance levels (x7 = 2.84). 

I turn next to the distributed lag coefficients of the various income 
terms. The w’s for factor income are very large and positive at first, 
then turn small and negative, and finally become positive again at the 
end. This general shape accords well with our expectations.!” The ¢; 
coefficients for transfer payments follow a similar shape but are much 
more erratic and less well pinned down econometrically. A notable 


‘7 Because of the estimating form, it is unlikely that simultaneity has much to do with 
the large estimate for wo. See Appendix B. 
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feature is that their sum is nearly 1.2, indicating “overspending” 
during the first 2 years after receipt of a transfer payment. 

The v; coefficients for regular taxes also exhibit a characteristic 
U-shape, but in much more muted fashion. As compared with factor 
income, spending in the first year after a regular tax cut is apparently 
substantially less, after which it catches up. This was surprising at first, 
since Dolde (1976) and Modigliani and Steindel (1977) had suggested 
that regular tax changes are “more permanent” than regular in- 
come.!® However, it turns out that the following second-order au- 
toregressives describe the deviations from trend of personal income 
P;) and regular taxes (7;): 


P= 132P,,— 38P.,, 
(.09) (.09) 

T; = An pieeey os Elia: 
(.09) (.10) 


Using the formulas derived earlier for permanent income, these 
time-series models imply that “permanent personal income” and 
“permanent taxes” are given by P? = 14.9P, — 5.7P;_, and T? = 9.37; 
— 1.97;_,, so that, contrary to Dolde and Modigliani and Steindel, we 
should actually expect a stronger short-run response of consumption 
to fluctuations in P, than to fluctuations in T;, which is exactly what I 
find. 

The next column, the £;’s, are in some sense out-of-sample ex- 
trapolations since there are no “pure windfalls” recorded in the 
data.'® Their only use is to form the weighted average Aw; + (1 — A)B;, 
which is reported in the column marked “y;.” These are the ex- 
penditure coefficients for income from a temporary tax that remains 
on the books for the entire 2-year horizon. To illustrate the opposite 
extreme, the column marked “Rebate” shows the spending 
coefficients for a temporary tax that lasts only 1 quarter. These two 
columns differ in details but are quite similar. There is a moderate 
spending response in the initial quarter, followed by very little 
spending over the next 3 or 4 quarters. Most of the spending out of a 
temporary tax cut, according to these estimates, comes 5 or more 
quarters after the cut. 

The coefficient of assets (.02) is comparable to what others have 
estimated, though a bit on the low side. A likelihood ratio test of the 
null hypothesis kg = 0 (which, in this constrained form, also implies k3 


18 But see Dolde (1979), where the transitory nature of allegedly permanent tax 


changes is stressed. 
19] made an attempt, in some early regressions, to treat the National Service Life 


Insurance Dividends of 1950 in this way, but I was not successful. 
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=... = ks = 0) produced a test statistic of 17.8, which is highly 
significant at any reasonable significance level (table 1, row 8). 

If we ignore the fact that income from property is included in the 
measure of income, the parameter ky can be given an interesting 
theoretical interpretation. In the basic life-cycle model, the consumer 
maximizes a utility function of the form 


Pe ee 
ran ts 


subject to a lifetime wealth constraint. It can be shown by straightfor- 
ward computations that the optimal solution for initial consumption Is 
Co = [r — g)/(1 + r)IW, where W is lifetime wealth and g is the optimal 
growth rate of C,, defined as g = [(1 + r)/(1 + p)}® — 1. This means 
that k, corresponds to the theoretical coefficient (r — g)/( +7), which 
for small values of r and p is approximately equal to r + (1/5)(p — r). 
Thus for small values of r the estimated value k, = .02 implies that the 
subjective discount rate, p, is approximately .026 or around 2-3 
percent per quarter for plausible values of 6. 

One striking result, though it is peripheral to the subject of this 
study, is the tiny coefficient of the real after-tax interest rate.*? This 
finding turned up in every specification of the model, including 
several alternative measures of the rate of interest. (Sometimes the 
coefficient was trivially negative, sometimes trivially positive, but al- 
ways trivial.) While it accords well both with my earlier work (Blinder 
1975) and with the work of others, it stands in sharp contrast with 
Boskin’s (1978) recent finding of a strong positive interest elasticity of 
saving. 


2. Temporary versus Permanent Taxes 


We can now address the principal issue of this study: How effective 
are explicitly temporary income tax changes as compared to those 
announced to be permanent? Table 4 contains the answers derived 
from the model, using the parameter estimates presented in table 3 to 
make equation (4) operational and using an annual real interest rate 
of 3 percent in updating wealth according to equation (10). It can be 
seen from column 4 that a temporary tax is about one-half as effective 
as a permanent tax in the first year, rising to about three-quarters as 
effective in the second year. Spending out of a rebate is somewhat 
slower than this. My estimated cumulative spending propensities out 


20 The specific coefficient in table 3 means that a 1 percentage point rise in r lowers 
savings by about .02 of 1 percent of disposable income—a trivial amount. 
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TABLE 4 


RELATIVE EFFECTIVENESS OF TEMPORARY TAXES 














CUMULATIVE SPENDING PROPENSITIES RATIOS 
Permanent 2-Year Rebate (2)/(1) (3)/(1) 
J (1) (2) (3) (4) (5) 
0 34 .16 16 A7 A7 
1 -50 oo 14 .46 28 
2 -55 26 a6 cA .29 
3 56 30 mall 54 38 
4 .59 36 .30 61 51 
5 .65 46 43 ak .66 
6 TS 57 .56 .78 Bad 
7 81 .65 .66 .80 81 





of a rebate are larger than those estimated by Modigliani and Steindel 
(1977) for the first few quarters but smaller thereafter. 

These findings carry two important messages to fiscal policy plan- 
ners. First, and most obvious, is that temporary taxes are less powerful 
devices for short-run stabilization purposes than are permanent ones. 
Second, and perhaps almost as important, the short-run relative inef- 
fectiveness of such taxes implies that the impact of these measures in 
the second year is larger than might be expected. For example, 
according to table 4, each $1.00 of permanent tax reduction adds 
$0.25 to spending in the second year, while each $1.00 of a rebate 
adds $0.45. If the need is for a truly short-run stimulus to aggregate 
demand, this effect may also be unwanted. 

Both of these points can be illustrated by examining what the 
equations have to say about the 1975-76 episode. First, it is useful to 
display the observed APCs for this period in table 5. There are two 
obvious phenomena crying out for explanation in these data. First, 
why did the APC drop so sharply in 1975:2? Second, why did it 
thereafter begin a steady climb to what is a truly extraordinary level 
by 1977:1? (The corresponding personal savings rate was only 4.2 


TMAlbs, 5 


AVERAGE PROPENSITIES TO CONSUME, 1975-77 











1975 1976 1977 
First quarter 913 " 914 .935 
Second quarter .881 .918 .926 
Third quarter .903 .922 .922 
Fourth quarter .907 .926 .925 





Source.—U.S. Bureau of Economic Analysis (various issues). 
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TABLE 6 


EsTiMATED EFFECTS OF THE 1975-76 TEMPORARY 
Tax Cuts ON CONSUMER EXPENDITURES* 











Quarter Estimated Spending Effect 
NO oer Oe 
OAD 1.6 
MOOR all 
OO 4.8 
1976:2 6.7 
1976:3 6.7 
NOOR: Tel 
MS fata 6.6 
MOT ez 
OM Taste eo 
NO Tifa: igs 





* In billions of 1972 dollars. 


percent—the lowest figure that had then been recorded since the 
Korean War.) 

According to the estimates presented in this paper, the temporary 
tax cuts of 1975-76 contributed to both phenomena. Using the 
spending coefficients presented in table 4, table 6 shows the estimated 
direct (excluding multiplier) effects on consumer spending of the tax 
cuts of 1975:2 through 1976: 2, inclusive.”! It appears that (1) very 
little of the rebate was spent in 1975: 2, (2) rather little of the disposa- 
ble income attributable to the temporary tax cut package was spent 
during the remainder of 1975, and (3) more spending out of the 
temporary tax cuts was done in 1976 and yet more in 1977. Both the 
low APC of 1975: 2 and the high APCs of late 1976 and early 1977 are 
tracked very well by the model. One observation which virtually 
jumps from table 6 is how very small these estimated spending im- 
pacts are relative to the size of the economy they were meant to 
stimulate (real GNP in the neighborhood of $1,250 billion). 

Finally, there is one more question. If, instead of the 1975:2- 
1976:2 package of temporary measures, the government had cut 
taxes “permanently” in 1975:2 and then restored them to their origi- 
nal level starting in 1976:3, how large a tax cut would have achieved 
the same average effect on aggregate demand?” 

Table 7 summarizes the model’s answers to this question for three 
different choices of the horizon over which the “average effect on 
aggregate demand” might be defined. The first column gives the 
average direct impact on spending attributed by the model to the 

*! The reader is reminded that only these 5 quarters are considered temporary cuts. 


” For this calculation I assume that consumers were successfully fooled into thinking 
that the 5-quarter tax cut would be permanent. 


TEMPORARY INCOME TAXES AW 


SWAN Ee 


EQUIVALENT PERMANENT TAXES**+ 











Average Impact on Cumulative Revenue Loss over 
Spending of Actual 5 Quarters of Permanent Tax Cut 
Horizon 1975-76 Cuts with Equal Average Impact on Spending 
4 quarters Sel) Om 
6 quarters ad 12.4 
8 quarters Did 15.9 





* See text for definition. 
7 In billions of 1972 dollars, at annual rates. 


1975-76 tax cuts. The next column shows how much total tax reve- 
nue the government would have had to relinquish during the same 
5-quarter period (1975 : 2—1976: 2) in order to achieve the same direct 
impact on spending through a permanent tax cut. Since the total 
5-quarter revenue loss from the 1975-76 package was $20 billion, 
these numbers mean, for example, that a permanent tax cut about 
half as large ($9.5 billion vs. $20 billion) would have had the same 
first-year effect on aggregate demand. Over a 2-year horizon, how- 
ever, the 1975-76 package had about 80 percent as much “bang for 
the buck” as a permanent tax cut. 


VII. Summary 


Both economic theory and casual empirical observation of the U.S. 
economy suggest that short-run spending propensities from tempo- 
rary tax changes are smaller than those from permanent ones, but 
neither provides much guidance about the magnitude of this differ- 
ence. This paper offers new empirical estimates of this difference and 
finds it to be quite substantial. 

The analysis is based on an amendment of the standard distributed 
lag version of the PIH that distinguishes temporary taxes from other 
income on the grounds that the latter is “more transitory.” This 
amendment, which is broadly consistent with rational expectations, 
leads to a nonlinear consumption function. 

Though the standard error is unavoidably large, the point estimate 
suggests that a temporary tax change is treated as a 50-50 blend of a 
normal income tax change and a pure windfall. Over a 1-year plan- 
ning horizon, a temporary tax change is estimated to have only a little 
more than half as much impact as a permanent tax change of equal 
magnitude, and a rebate is estimated to have only about 38 percent as 
much impact. The model tracks both the extraordinarily high savings 
rate of 1975:2 and the extraordinarily low savings rates of late 1976 
and early 1977 very well and attributes part of both phenomena to the 
temporary tax measures of 1975-76. Finally, it is estimated that a 


48 JOURNAL OF POLITICAL ECONOMY 


permanent tax cut of about $9.5 billion (in 1972 dollars) would have 
had the same impact on aggregate demand over the first 4 quarters as 
the $20 billion of 1975—76 tax cuts. 


Appendix A 
The Lifetime Budget Constraint 


This Appendix demonstrates a result that, to my knowledge, is not very well 
known: that the long-run MPC corresponding to a consumption function of 
the form 


Cp= kAy + > Yes (Al) 
j=0 


is unity for any value of k (greater than the real rate of interest) and for any 
b’s. 
Proof: Write equation (10) in the text as 
[RS Ue ble en Ges, (A2) 
and write (Al) as 
C, = kA, + D(L)Yi, (A3) 
where L is the lag operator and b(L) = by + dL +... + OnL”. Applying the 
operator 1 — (1 + r)L to (A3) and using (A2) yields [1 — (1 +r — R)LIC; = {kL 
+ [1 — (1 + r)L]b(L)}Y,, which can be written 
C, = B(L)Y:, (A4) 
where 
kL + (1 — (1 + r)L]0(L) 


BP) eee a1 OL 


(A5) 


To obtain the lifetime spending generated by a $1.00 impulse in Y;, we must 
compute the discounted sum of coefficients: 


= \ ab ie ——— 
MPC py 7a (A6) 


To simplify the notation, let 9 = 1 + r — k. Assuming that 6 < 1 (1e., thatk > 
r), (A5) can be written: B(L) = {kE + [1 — (1 +r)LJO(L)}(1 + OL + 6717 + ...). 
This is of the form B(L) = By + B,L + BLL? + ..., with the following 
coefficients: 


By — 0, 
15 = Wy ae Wo se 2 (Gl sk Dalle 
By = 67by + O16, +k — (1 + r)bol + [b, = (1 + r)bi), 


1 My ae OO ee — (Ol esol ae 6 5 5 ce a, — (Al Se AY alk 
Baw = Oued G == O"[b, +7 k ar ( a r)bo] tae iaats Old, ae (1 aa 7) Oerll 
, qd a m0, 
Brsiss = OBrsi, = 12 
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Substitution of all of these into (A6), and some truly horrendous grinding, 
establishes that MPC = 1 regardless of the magnitudes of k and the b, (as long 
aS he 20). (ED), 

A brief word on the interpretation of the sum of the 0; in (A1) may be in 
order here. Should Y, rise permanently by $1.00—a statement that is basically 
meaningless if the autoregressive process assumed in the text (eq. [2]) really 
holds—the eventual change in C, would, by(A4) be BU) =3,2,B7. According 
to (A5), this sum is 


k-r >, ; 
i=0 


Bl) = = Ee = > *b;). 
(1) aa ers ore >, 5} 


9—0 





Thus 26; controls the size of the spending response to a hypothetical perma- 
nent rise in income; it does not influence the lifetime MPC. 


Appendix B 


Details on the Estimating Equation 


This Appendix derives and explains the equation that was actually estimated. 
I begin by repeating equation (14) of the text: 


q 
Cy = ho + hire¥s + holWi— A) + > ha esAe-s 
j=0 


ns S ws(Yes SNe She > 4) ( = A)X} = Ss, (i= DYSt.| “t by, 
=0 ad - 


For purposes of reducing heteroscedasticity, the assumption was made that 
the standard deviation of u,; was proportional to Y;, so the whole equation was 
divided through by Y, to get 





q 

Aes 

APC.= a + kur, + by{ Oe ca * ie Y, 
t t t j=0 


ia S wiles Sander >a,|a — d)x} + oa — Dis. + &, 
j=0 j=0 i=1 


i= 


(BI) 


where 
Qeay= 1%; (mote: z= 1 tor allt), 
Sve 
Si ai cn: 


= esha 
Gti = St-j — AX1, 


Ut 


Cie ae 
Y, 
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To estimate (B1), the assumption was made that both w; and B; follow 
third-degree polynomials in 7: \ 


WwW; = dy) t+ ay + ag” + ay’, (B2) 
B; = bo + by + bay? + ba’. 
The end-point constraints mentioned in the text (wy+; = Bn+1 = 0) are thus 
Ao + (n + 1)a, + (n + 1)'aq + (n + 1)?a3 = 0, 
by + (n + 1)b, + (n + 1)*by + (n + 173 = O. 


(B3) 


Equations (B3) were used to eliminate the parameters ad) and do. ‘The 
adding-up constraint discussed (and rejected) in the text was thus 


S’failj — (n+ WN + aalj? — Ge + 11 F aol? — (n+ VT}, 
i=0 (B4) 
= S{bilj — (n+ D) + bal? — (w+ VA] + bal? — (+ 5D, 
7=0 
which was used to eliminate the parameter 4. 
Finally, in estimating (B1), €; was assumed to follow a first-order autore- 
gressive scheme, € = pé-1 + &, where & is white noise. Estimation was by 


nonlinear least squares, which is equivalent to maximum likelihood if e is 
normally distributed. The function actually minimized was 


F 


> [arc lee he gee = =) 
f=1 Y; Y, 
q A ¥y n 
a D hes Y, oe Dt c qt-i) 


Yala ee = Dist] - pecs} 
j=0 


i=1 





with all the above-mentioned definitions and parameter restrictions substi- 
tuted in. 

The estimating forms when regular income was further disaggregated were 
derived in precisely analogous ways from equations (16) and (18). 


Appendix C 


Calculation of the Real After-Tax Interest Rate 


The real after-tax interest rate is defined as 7; = i;(1 — tT) — 7, where 2; is the 


nominal interest rate, 7; is the marginal tax rate, and 7; is the expected rate of 
inflation. 


Nominal interest rate. —Four different nominal interest rates were tried: a 
corporate bond rate, the 4-6 month commercial paper rate, the 3-month 
Treasury bill rate, and a weighted average of rates received on various time 
and savings accounts. While all four led to very similar results, the Treasury 
bill rate gave the best fit and so was adopted. 

Marginal tax rate. —The marginal tax rate was created from the average tax 
rate in the following way. Let T(y) be the tax function facing an individual 
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taxpayer, and let f(y) be the frequency distribution of income. The only 
directly observable tax rate is the aggregate average rate, which is 4 = 
TOO ay Vi fof ay]. 

Now suppose the whole income distribution shifts to the right with no 
change in its shape; that is, it shifts from f(y) tofLy(1 — h)], where h connotes a 
“small” multiplicative shift. The average income is then Y(h) = fyftyd = 
h)|dy, so that, for small shifts (h near zero), d¥ /dh = — Jy? )dy. Similarly, 
average tax payments after the shift are T(h) = {TO)flyd — h)]dy, so that 
dT /dh = — SoTO)f'()dy. The aggregate marginal tax rate, M, is the ratio of 
these: M = [fyT(y)f'(y)dy Mf yf (yay. 

To evaluate these integrals and obtain a closed expression for M/A, I 
adopted the following functional forms: 


T(y) = ay? (o> o>), 
IO) =e” y= 0): 


With these assumptions, the two ratios of integrals work out to be A = 
@yly*)T(b + 1), M = (ay?/2y"*?)P(b + 2), where P'(n) is the “gamma 
function,” namely, [(n + 1) = nI'(n). Hence M/A = (b + 1)/2, and using a tax 
elasticity of b = 1.6 gives M = (1.3)A. 

The average tax rate was computed, quarter by quarter, by dividing the 
sum of federal and state-local personal tax and nontax payments by personal 
income excluding transfers (an approximation to the tax base). These are all 
official national income accounts series. 

Expected rate of inflakon.—Inflationary expectations were generated by a 
model based on what has been called “economically rational” expectations.”8 
The idea is that agents, in informing themselves, begin with the data that are 
cheapest per unit of informational content and then proceed to process more 
costly data until the marginal cost and (expected) marginal benefits are 
equated. In this particular application, I assumed that consumers base their 
expectations of the inflation rate (P;) on its own past history and on the history 


of the growth rate of the money supply (M,). Thus I estimated an equation 
I 


J 
Py =a) + > asPry+ > biMia + e 
j=1 i=0 


on actual U.S. quarterly data, using the deflator for personal consumption 
expenditures for P, and M, for M,, and assumed that consumers used this 
equation to generate expectations. In estimation, I used the Almon (1965) lag 
technique with third-degree polynomials, no end-point constraints, and vari- 
ous choices for J and/. The best results were obtained with J = 11 and/ = 17, 
namely 


P; een 
v 
standard error = 1.42, mean of dependent variable = 3.35, 


La; = .67 Db; a 
(.10) (.09) 


11 Miz 
60+ > aPi,+ > bM-i,R? =.73, D-W= 1.98, 
2S) at 0 


#3 See Feige and Pearce (1976). 

*4 Standard errors are in parentheses. The D-W is the Durbin-Watson statistic. The 
period of estimation was 1951:3-1977:4, the longest period possible given the need for 
17 lagged values of M. 


52 JOURNAL OF POLITICAL ECONOMY 


References 


Almon, Shirley. “The Distributed Lag between Capital Appropriations and 
Expenditures.” Econometrica 33 (January 1965): 178-96. 

Blinder, Alan S. “Distribution Effects and the Aggregate Consumption Func- 
tion. sb.) 60, No.3 (june 19/5). 427-715, 

. “Intergenerational Transfers and Life Cycle Consumption.” 4.E.R. 
Papers and Proc. 66 (May 1976): 87-93. 

Blinder, Alan S., and Solow, Robert M. “Analytical Foundations of Fiscal 
Policy.” In The Economics of Public Finance, by Alan S. Blinder and others. 
Washington: Brookings Inst., 1974. 

Boskin, Michael J. “Taxation, Saving, and the Rate of Interest.” J.P.E. 86, no. 
2, pt. 2 (April 1978): S3-S27. 

Darby, Michael R. “The Allocation of Transitory Income among Consumers’ 
Assets.” 4.E.R. 62 (December 1972): 928-41. 

. “Postwar U.S. Consumption, Consumer Expenditures, and Saving.” 
A.E.R. Papers and Proc. 65 (May 1975): Ween 

Dolde, Walter. “Forecasting the Consumption Effects of Stabilization 
Policies.” Internat. Econ. Rev. 17 (June 1976): 431-46. 

. “Capital Markets and the Short Run Behavior of Life Cycle Savers.” J. 

Finance 33 (May 1978): 413-28. 

. “Temporary Taxes as Macro-economic Stabilizers.” A.E.R. Papers and 
Proc. 69 (May 1979): 81-85. 

Eisner, Robert. “Fiscal and Monetary Policy Reconsidered.” 4.E.R. 59 (De- 
cember 1969): 897-905. 

Feige, Edgar L., and Pearce, Douglas K. “Economically Rational Expecta- 
tions: Are Innovations in the Rate of Inflation Independent of Innovations 
in Measures of Monetary and Fiscal Policy?” J.P.E. 84, no. 3 (June 1976): 
499—522. 

Foley, Duncan K., and Hellwig, Martin F. “Asset Management with Trading 
Uncertainty.” Rev. Econ. Studies 42 (July 1975): 327-46. 

Friedman, Milton. A Theory of the Consumption Function. Princeton, N.J.: 
Princeton Univ. Press (for Nat. Bur. Econ. Res.), 1957. 

Goldfeld, Stephen M., and Quandt, Richard E. Nonlinear Methods in Economet- 
rics. Amsterdam: North-Holland, 1972. 

Hall, Robert E. “Stochastic Implications of the Life Cycle-Permanent Income 
iia Theory and Evidence.” J.P.E. 86, no. 6 (December 1978): 
971-87. 

Houthakker, Hendrik S., and Taylor, Lester D. Consumer Demand in the United 
States. Cambridge, Mass.: Harvard Univ. Press, 1966. 

Juster, F. Thomas. “A Note on Prospective 1977 Tax-Cuts and Consumer 
Spending.” Unpublished paper, Univ. Michigan, January 1977. 

Lucas, Robert E., Jr. “Econometric Policy Evaluation: A Critique.” In The 
Phillips Curve and Labor Markets, edited by Karl Brunner and Allan H. 
Meltzer. Amsterdam: North-Holland, 1976. 

Modigliani, Franco, and Brumberg, Richard E. “Utility Analysis and the 
Consumption Function: An Interpretation of Cross-Section Data.” In 
Post-Keynesian Economics, edited by Kenneth K. Kurihara. New Brunswick, 
N.J.: Rutgers Univ. Press, 1954. 

Modigliani, Franco, and Steindel, Charles. “Is a Tax Rebate an Effective Tool 
a Stabilization Policy?” Brookings Papers Econ. Activity, no. 1 (1977), pp. 
175-203. 














TEMPORARY INCOME TAXES 53 


Muth, John F. “Optimal Properties of Exponentially Weighted Forecasts.” J. 
American Statis. Assoc. 55 (June 1960): 299-306. 

Okun, Arthur M. “The Personal Tax Surcharge and Consumer Demand, 
1968-70.” Brookings Papers Econ. Activity, no. 1 (1971), pp. 167-211. 

Sargent, Thomas J. “Rational Expectations, Econometric Exogeneity, and 
Consumption.” J.P.E. 86, no. 4 (August 1978): 673-700. 

Springer, William L. “Did the 1968 Surcharge Really Work?” 4.E.R. 65 
(September 1975): 644-59. 

U.S. Bureau of Economic Analysis. Survey of Current Business. Washington: 
Government Printing Office, various issues. 


Consumer Search with 
Uncertain Product Quality 





John D. Hey 
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Chris J. McKenna 
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This paper presents a unified model of consumer search when both 
prices and qualities are uncertain. It is assumed that price can be 
observed prior to purchase but that quality can be observed only 
after purchase and experience. This two-stage decision problem is of 
particular interest since, in practice, there 1s usually some connection 
between quality and price. Of the many interesting implications of 
this dependence explored in this paper is the result that the usual 
“reservation-price rule” may no longer be optimal; indeed, the op- 
timal stopping set may take one of several alternative (and more 
realistic) forms. 


I. Introduction 


The economics of search has made considerable progress in recent 
years in describing and explaining the actions of economic agents 
operating in uncertain environments. Our concern in this paper is 
with the application of this large body of knowledge to the consumer- 
search problem. In its simplest form this problem examines the be- 
havior of a consumer searching (at a constant cost per observation) 
over a known distribution of prices for the lowest (economically 
sensible) price for the purchase of one unit of some good. The 


We would like to thank an anonymous referee and George J. Stigler for some very 
helpful comments on two earlier drafts of this paper. We are also grateful to Louis L. 
Wilde for useful comments on the original draft. 
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solution, which is now well known, takes the form of a stopping rule 
uniquely determined by a “reservation price”; that is, the consumer 
behaves optimally by searching until a price lower than the reserva- 
tion price is obtained. 

In this simple form, and indeed in many of its more sophisticated 
generalizations, a crucial assumption is that search takes place over 
just one characteristic of the good—namely, its price. However, in 
practice, a good has several dimensions (of which price is but one); 
moreover, uncertainty surrounds not just the price dimension but 
these other dimensions as well. Of course, this observation would be 
trivial if information on these other aspects could be collected at the 
same time as information about price. However, in practice, some 
characteristics inevitably remain undiscovered until after purchase; 
thus, prior to purchase, residual uncertainty remains. 

The purpose of this paper is to explore the consumer-search prob- 
lem in this multidimensional form. In particular, we partition the set 
of characteristics into those which can be observed prior to purchase 
and those which can be observed only after purchase. To keep our 
exposition simple, we consider the simplest of all possible formula- 
tions: a two-dimensional case in which price is the dimension which 
can be observed prior to purchase and quality is the dimension observ- 
able only on purchase and experience. 

Our extension makes the consumer’s problem a two-stage one: The 
first stage relates to the price dimension—the consumer must decide 
what prices are “acceptable”; the second stage relates to the quality 
dimension—the consumer must decide what qualities are “accept- 
able.” What makes the problem particularly interesting is that, in 
practice, these two dimensions—price and quality—tend to be re- 
lated. This dependence means that the two stages of the consumer’s 
problem are also related. Moreover, this dependence implies, as we 
shall show, that the optimal stopping rule at the first stage may well no 
longer be of the reservation-price form. For example, it may well be 
optimal for the consumer to search for expensive (rather than cheap) 
goods; alternatively, it could be optimal for the consumer to search 
for goods in some middle price range. Indeed, a variety of possible 
stopping rules may apply at the first stage; as we shall show, the form 
depends crucially on the form of the dependence between quality and 
price. 

This two-stage problem belongs to a class of similar problems now 
beginning to appear in the literature. A similar consumer-search 
problem has been developed independently by Wilde (1977). In many 
respects his model is similar to ours, though it proceeds at a somewhat 
higher level of generality. Wilde’s work follows from an earlier paper 
(Wilde 1979) on the parallel problem in the job-search area (a prob- 
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lem in which the wage is observed prior to accepting a job offer, and 
the nonpecuniary job characteristics are observed after experiencing 
the job). This latter problem was studied independently, and in a 
slightly different form, by McKenna (1978). Work has also been 
carried out in this area by Lippman and McCall (in press). 

The paper is organized as follows: In the next section we introduce 
our notation and detail the assumptions of our model; the following 
section finds the solution to the consumer’s two-stage problem and 
examines the properties of this solution; a concluding section reviews 
and appraises the key results. 


II. The Model 


We assume that the consumer-searcher is interested in two charac- 
teristics of the good—quality and price. We denote by Q the quality of 
the good as measured by the per period money value of the product’s 
quality; we assume that the value of Q for a particular good can be 
observed only after purchase. The good yields Q for each period of its 
n-period lifetime; n is assumed to be the same for all goods. We 
denote by P the price of the good as measured by the price per unit 
per period of its lifetime (i.e., the per period “user cost” of the good).’ 

The consumer makes independent random selections from the 
joint distribution of P and Q; each drawing costs a (constant) amount c 
(> 0). The value of P is immediately revealed, while the value of Q 1s 
revealed only if purchase takes place. We assume that the searcher 
knows the joint distribution of P and Q over all goods: We denote by 
g(-) the marginal probability density function (pdf) of P, and by 
fe | p) the conditional pdf of Q given p; we use G(-) and F(- | p) to 
denote the corresponding distribution functions and permit P and Q 
to take values on the positive real line. 

‘The consumer faces an infinite horizon, discounts all values at the 
rate p (0 < p < 1), and wishes to have exactly one unit of the good in 
each period. Given these assumptions, the lifetime discounted value 
to the consumer is 


DPQ — Pr Si), (1) 


where Q, and P, are the quality and price of the good being consumed 
in period ¢, and S, is the amount spent on search (possibly zero) in 
period ¢. 


The individual’s objective is to find, and follow, the strategy which 


* Alternatively, the purchase price—payable at the time of purchase—is L7_,p'!P 
(where p, see below, is the discount rate). 
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maximizes the expected value of the expression (1). The model is built 
around the following assumed sequence of events: 


1. The consumer searches, at a cost c per search, over 
values of P until an acceptable value of P is obtained. We 
assume that this search process is instantaneous and that no 
passage of time occurs.” Thus the consumer can obtain in- 
stantaneously n “quotes” at a total cost nc, whether the 
searching is done sequentially or not. 

2. The consumer then purchases one unit of the good. 

3. After purchase, the consumer experiences the good and 
thereby discovers the value of Q associated with the good. 
(We assume that all uncertainties are resolved after pur- 
chase.) 

4. Having discovered Q, the consumer decides whether to 
keep the good for the whole of its n-period lifetime or to 
dispose of the good (at zero cost) at some point within its 
lifetime. If the latter option is used, the individual returns to 
stage 1. 

5. If the good is kept until the end of its lifetime, the 
consumer must decide whether Q was sufficiently good for it 
to be repurchased. If so, the good is continually repurchased 
thereafter; otherwise, search continues again at stage 1. 


The model is appropriate for regularly purchased nondurable 
goods for which the amount consumed per period is roughly con- 
stant, such as bread, cereals, eggs, washing powder, newspapers, and 
toilet paper. However, the purchase of some durable items such as 
televisions, cars, and washing machines may also be described in our 
framework.? 


lil. The Solution 


The consumer’s problem is a two-stage one: The first stage relates to 
the price dimension, the second stage to the quality dimension. Our 
solution similarly proceeds in two stages. 

We begin with the first stage (that described by 1 in the sequence of 
events above), namely, the stage relating to the price dimension. The 
consumer’s problem at this stage is to decide which price quotes are 


* This is crucial in'view of our earlier assumption that the consumer requires exactly 
one unit of the good per period. In contrast, Wilde (1977) postulates a time-consuming 
search process and assumes that the consumer derives no utility in periods when he 
does not have a unit of the good and derives utility U(P, Q) in other periods. 

* The repurchase of durable goods, however, may be made after a much longer 
period of consumption. For our model to be valid in describing purchase decisions of 
this type requires that we assume unchanging market conditions over long periods. 
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acceptable (in that purchase should take place at such quotes if ob- 
tained) and which price quotes are unacceptable (in that search 
should continue). In other words, the consumer must decide on the 
optimal acceptance set for prices. To find this optimal set, we first 
begin with an arbitrary acceptance set which we denote by ? (C #°). 
This set ? defines a stopping rule applicable at stage 1 as follows: 


if P € Y, stop searching and purchase good at P; 2) 


tied. Cs Y, continue searching. 


Here P denotes the latest price quote obtained. We wish to find the 
best such set, which we denote by #*, in the sense that it maximizes 
the expected value of the objective function (1). To find this A* we 
proceed as follows. Consider the consumer starting from a position in 
which no acceptable price has yet been obtained; suppose that the 
consumer uses the arbitrary acceptance set Y with respect to the first 
purchase decision but thereafter behaves optimally with respect to all 
future decisions (including those relating to any future return to stage 
1). Let V(P) denote the expected value of the objective function 
following this sequence of events. Clearly A*, by definition, is a set for 
which V (P) attains its maximum value; if we denote this latter by V*, 
we have 


V®2V@P) for all PE 22", 
VV (Fs). 


(3) 


We will demonstrate that P* need not be determined by a unique 
reservation price (i.e., 2* need not be of the form {P; P = P*}). This 
is because the direct attraction of low prices may be more than offset 
by the indirect detraction of associated low qualities. In mathematical 
terminology, the return function from accepting P need not be 
monotonically decreasing in P because of the relationship between Q 
and P in the joint distribution. We pursue this formally below. 

We now turn to the second stage (that described by 4 and 5 in the 
sequence of events above), namely, the stage relating to the quality 
dimension. The consumer’s problem at this stage is to decide what 
qualities are acceptable, both in terms of keeping the good until the 
end of its lifetime and in terms of repurchasing that good at the end 
of its lifetime. In contrast with the decision at stage 1, these acceptance 
decisions do have the reservation property—the higher the Q, the 
better off the individual. 


“In contrast with the first stage where a higher P may make the individual better or 


worse off. For a formal demonstration of the textual assertion see Hey (1979) and/or 
Theorem 1 of Wilde (1979). 
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We wish to determine the optimal reservation qualities; to find 
these we first begin with arbitrary reservation levels defined as fol- 
lows. Let q; denote an arbitrary reservation quality used at the end of 
period 7 following the purchase of the good (¢ = 1,...,n — 1).° Using 
these arbitrary values, the decision at stage 4 is as follows: 

if Q < qj, dispose of the good at the end of period i (if not 
already disposed of) and return to stage 1; 
if Q = qi, keep the good (at least until the end of period 
f4:1), 
Here Q denotes the quality of the good purchased. 

Further, let g, denote an arbitrary reservation quality used at the 
end of period n (at the end of the life of the good). Using this 
arbitrary value, the decision at stage 5 is as follows: 


if Q < qn, do not repurchase the good; instead return to 
stage 1; 


(5) 


if Q => gn, continue repurchasing that good forever. 


Although not essential, particularly for arbitrary qj, it would seem 


appropriate to assume that g; = qi (@@ = 1,..., — 1). 
To find the consumer’s optimal strategy, we need to find the best set 
of gi (¢ = 1,...,m), which we denote by q¥ (¢ = 1, ... , m), in the sense 


that this set maximizes the expected value of the objective function 
(1). To find these g¥ we proceed as follows. Consider the consumer 
starting from a position in which a purchase at price P has just been 
made. Suppose that the consumer uses arbitrary reservation qualities 
gi (@ = 1,...,n) with respect to this purchase but thereafter behaves 
optimally with respect to all future decisions (including those relating 
to any future returns to stages 4 and 5). Let W(P; q,,... , qn) denote 
the expected value of the objective function following this sequence of 
events. Clearly, g# (i = 1,...,) is a set for which W(P; q;, ..- , dn) 
attains its maximum value (for given P); if we denote this latter by 
W*(P), we have 

Wa ea ge, g,) for allg; CAG = 1,.> 1), (6) 
W*(P) = W(P; qi, .- - > Qn): 


This completes our notational preliminaries, and we can now proceed 
to an investigation of A* and the q?. 
If the stopping set f is used at stage 1, we have 


V(P) = -—c + V(PN — fag (p)dp] + [,W*(p)g(p)dp. (7) 


5 We exclude 7 = 0 on grounds of realism; its inclusion would not materially affect 
any of the results. 
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This equation states that, starting from a position in which no accept- 
able price quote has been found, the individual searches once more at 
cost c and either receives an unacceptable quote (returning to “square 
one”) or receives an acceptable quote with expected value of W*(p).® 
Equation (7) gives: 
V(P) = [-c + J,W*(p)e(p)dpV/J,g(p ap. (8) 
Now, it can easily be shown (Hey 1979) that an optimal set is given by 
P* = {P; W*(P) = V*}, where c = J,,[W*(p) —V*lg(p)dp. (9) 


The first of these simply states that a price quote should be accepted if 
the expected value (of the objective function) obtainable by accepting 
it is at least as large as that obtainable by rejecting it. The value of V* is 
determined by (9); we assume that this is positive (otherwise it 1s not 
optimal for the consumer to initiate search for the good). 

Examination of (9) shows that the form of the set A* depends 
crucially on the behavior of the function W*. The familiar case is 
when W* is a monotonically decreasing function (1.e., when higher 
prices unambiguously make the consumer worse off); in this case, as 
can be seen from (9), the set A* is given by {P; P = P*}, where P* is 
the familiar reservation price. However, in our model there is nothing 
to guarantee that W* is monotonically decreasing; indeed, W* may 
take a variety of different forms, depending on the relationship be- 
tween quality and price in the joint distribution. In the extreme, it is 
possible (although unlikely) that quality increases so fast with price 
that the function W* is monotonically increasing; in such a case the 
acceptance set ?* would take the form {P; P = P*} where P* is a 
lower reservation price—a buy-expensive policy would be optimal. 
The general case (one in which W* has both increasing and decreas- 
ing segments) is illustrated in figure 1. 

As the form of the function W* is crucial to the consumer’s optimal 
behavior, we now proceed to an investigation of its properties. From 
our story of consumer behavior, we have 


Wadi 0s adn) — [pa =p")! ( leap) +f aq Pa (10) 


+ iy le f(q | p)dq +f tq Oa 


. ervey” fa | p)dq + a lq — pA — pe) | p)aq 
(where go = 0). 


® Note that the search process is assumed to be instantaneous and thus the expected 
values following the search are not discounted; see also n. 2 above. Note further that if 
cee then (7) yields V(¢) = —c + V(b), which (if ¢ > 0) implies that V(#) is minus 
infinity. 
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w*(p) 


Ve 





CONTINUE CONTINUE 


Fic. 1.—Derivation of the set ?* in the general case 


The optimal reservation qualities at the end of periods 1 to (n — 1) 


—the g¥ (¢ = 1,..., — 1)—are found by setting OW/dq; = 0. It is 
easy to show that this implies that 

gg = V*(1 — p) @= 1, ey ne a1). (11) 
Two things of interest flow from this: First, gj = . .. = g#1, which 


means that if the good’s quality is sufficiently high for it not to be 
thrown away at the end of the first period of its lifetime, it should be 
kept for the remainder of its lifetime. (This result is a consequence of 
our assumed constancy of the per period quality rate; if, in contrast, 
quality were to decline as the good got older, this strong result would 
no longer hold.) Second, ¢§ = V*(1 — p), which means that a good 
should be thrown away (before the end of its useful life) if and only if 
its quality is lower than the per period value obtainable from starting 
search afresh. 

The optimal reservation quality at the end of the good’s lifetime is 
found by setting OW/dq, = 0; this implies that 


qa = V*(1 — p) +p, ve) 


which states that the consumer should repurchase the good if and only 
if its quality net of price is at least as large as the expected value 
obtainable from starting search afresh. Both this, and the earlier 
result (11), accord with intuition. 

If we now substitute (11) and (12) into (10), we get an expression for 
W*; after some simplification, this yields 


We(p) = pV* — [pd — pl - py + J 1 -F(q| pydg 
+(e ea pyf 1 -F@|pydg 3) 


+ to" ~ ey J 1 - Fg] pyldy 
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(where, in view of the equality of the g# forz = 1,...,n — 1, we simply 
work with g# and q*). 

Equations (9), (11), (12), and (13) solve for V*, W*(-), q7, qn amide = 
Imterins ofc. O11: | -), and g(-); they provide the complete solution 
to our model. 

The form of (13) can now be investigated; from it and (12) we have 


Ole 
W*"(p) = —(1 — p")/(1 — p) -| Swe ay 


eon 
— {ci ~ pic ~ pf, 2 GIP) a 


(14) 


ork 
= tor = pf. Ne, 
— (o"/(1 =~ p)ILl ~ Fgh |p). 


Inspection of this shows that W*’ is unambiguously negative if 
dF (q|p)/Op is positive everywhere; however, this latter condition 
implies that F (- | p) is stochastically decreasing in p—or, in economic 
terms, that quality is stochastically decreasing in price. This is highly 
unlikely in practice. A more realistic case is to assume that quality 
generally increases with price so that F(- | p) is stochastically increas- 
ing in p (i.e., F(t| p:) = F(t| ps) for all t whenever p, < ps; see Hadar 
and Russell [1969]). However, this restriction is not sufficient to sign 
W*'. On reflection, this is not a surprising result; one can imagine 
cases where quality increases very strongly with price (so that W*' is 
positive) and, also, cases where quality increases very slowly with price 
(so that W*’ is negative—the direct price effect outweighing the 
indirect quality effect). In general, it is possible that W*’ has both 
positive and negative sections, in which case the stopping rule at stage 
1 would be of a form similar to that shown in figure 1. 

Wilde (1977, 1979) also discusses similar possibilities (see, in par- 
ticular, Wilde [1979], n. 11). Moreover, because his model proceeds at 
a rather higher level of generality than our own (namely, his objective 
function is not simply linear in quality and price), he finds restrictions 
(sufficient to sign his equivalent of our W*’) on the joint distribution 
virtually impossible to obtain. Instead, he simply assumes that the 
distribution is such that the return function (W*) is monotonically 
decreasing. 

However, some further insight can be gained as follows.’ Denote by 
Q, the (random) quality associated with the price p and denote the 


" Although our model assumes that “utility” is linear in price and quality, our results 
would still go through if utility was a separable function of price and quality if the 
distributions f(: ‘) and g(-) were reinterpreted appropriately. 
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distribution function of Q, by F,(-) = F(- | p). Suppose that the 
conditional distributions are such that 


F,(q) = Fo(q — kp), (15) 


that is, Q, is a translation of Qo. (Thus the distribution of Q, is the 
same as that of Q, shifted to the right by a distance kp.) It is a simple 
matter to show that 


AF (q| pylap = —kf(q |p). (16) 
If this is substituted into (14), we find, after some simplification, that 
Wy (he Oil othigs |p) —p) 17) 


— kp(1 — p™)\FQ¢|p)/(1 — p). 


There is an important special case in which the second term in (17) 
disappears; this is when F(qi'|p) = 0, or, in economic terms, when 
there is no possibility of finding a quality so bad that it pays to throw 
the good away before the end of its lifetime.*® In such a case, equation 
(17) implies that 


W*'(p) = 0 according as k 2 1. (18) 


This result makes good sense: It states that the return function is an 
increasing (decreasing) function of price if q — p (the quality net of 
price) is an increasing (decreasing) function of price. Moreover, if g — 
p is independent of price (that is, if k = 1), W* is horizontal (at the 
level V*), and the consumer simply accepts the first price quote re- 
ceived whatever its value. 

The general case is when F (q% | p) > 0, that is, when there exists a 
positive probability of getting a good whose quality is so low that it 
pays to throw it away and start search afresh. In this general case, the 
simple result (18) no longer holds. However, as inspection of (17) 
shows, if k is less than or equal to one, then W*’ is strictly negative. 
Thus, if “net quality” (q — p) decreases with price, the return function 
is monotonically decreasing and a buy-cheap policy remains optimal. 
However, k larger than | is not now sufficient for W*' to be positive, 
because the (nonzero) possibility of getting a “lemon” destroys the 
generally greater attractiveness of higher-price goods.” However, by 
differentiating (17) (and using [16]) it can be shown that & greater 
than 1 is a sufficient condition for W*” to be positive. Furthermore, 
from (17) it can be seen that lim,... W*’(p) = (k — 1)/(1 — p) > 01ifk > 


8 This effectively is the case considered by Wilde (1977). 

9 A sufficient condition for W*’ to be positive is that k is at least as large as (1 — p")/(1 
— p), so that net quality increases sufficiently fast with price to outweigh the nonzero 
lemon possibility. 
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W* (Pp) 


CONTINUE CONTINUE 





(a) 
Fic. 2.—Derivation of A* when (15) holds with k > 1 


1.!° These results imply that if k is greater than 1 (e., if net quality 
increases with price), then W*’ is monotonically increasing and that 
even if it is negative at lower values of p, it must become positive at 
some higher value. Thus, if & is greater than 1, one of the two 
possibilities pictured in figure 2 must exist. 

Case b in figure 2 is particularly interesting, implying that the 
consumer should search until either a very low, or a very high, price 
quote is obtained. Goods at intermediate prices should not be pur- 
chased. This result follows from the combination of net quality in- 
creasing with price and the lemon possibility: High prices are at- 
tractive because the associated high qualities are a sufficiently great 
incentive to outweigh the possible lemon effect; low prices are attrac- 
tive despite the associated low qualities because of the small loss in- 
volved in throwing away a lemon which cost little in the first place; 
intermediate prices are not attractive because they fall between 
these two stools. 

The results discussed in the paragraphs above are, of course, 
specific to the conditional distribution formulation given by (15). 
However, these results do yield some insight into the general con- 
siderations underlying the form of the function W*; clearly important 
is the (stochastic) relationship between net quality (¢ — p) and price. 
Alternative formulations can easily be envisaged. For example, imag- 
ine that the conditional distribution of net quality increases at an in- 
creasing rate with price up to some intermediate price and thereaf- 
ter increases at a decreasing rate. In such a case, W* may be an 
increasing function for low p and a decreasing function for high p; the 
price-acceptance set would then consist of intermediate prices, both 
very low and very high prices being rejected in favor of further 
search. 

Clearly, a whole variety of possibilities exist: Depending on the 


* Since F,(q), as defined in (15), approaches zero as p approaches infinity for any 
finite q. 
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conditional distribution of quality given price, buy cheap, buy expen- 
sive, buy medium, or various combinations of these are all plausible 
buying strategies. Thus “gaps” in the stopping set Y* can be quite 
consistent with optimal behavior. 

Before concluding we note the following comparative static results. 
They can be derived from equations (9), (11), (12), and (13); the 
detailed proofs can be found in Hey and McKenna (1979). 

1. Each of the following makes the consumer-searcher better off in 
that V* and W*(p) increase: (i) a decrease in the search cost c, (ii) a 
decrease in the lifetime of the good n, (iii) a rightward shift of the 
conditional distributions of Q,, (iv) a rightward shift of the marginal 
distributions of P if W* is a monotonically increasing function, (v) a 
leftward shift of the marginal distribution of P if W* is a monotoni- 
cally decreasing function, (vi) an increase in risk (in the Rothschild 
and Stiglitz [1970] sense) in the conditional distributions of Q,. Con- 
sequently the consumer becomes more choosy in that g# and q* in- 
crease and the set P* gets smaller. 

2. The expressions V*(1 — p) and W*(p)(1 — p) increase if p 
increases, that is, if the consumer places greater weight on the future; 
consequently, the consumer gets more choosy in the present in that q 
and q%*, increase and the set Y* gets smaller. 

All of these results accord with intuition, with the possible exception 
of that relating to n. However, when it is remembered that P is the 
price per period of the good’s lifetime, it does make sense to conclude 
that a smaller n is better, in that it reduces the time spent “locked in” 
to an unsatisfactory good. 


IV. Summary and Conclusions 


This paper has presented a unified model of consumer search with 
both price and quality uncertainty. Because price is revealed prior to 
purchase while quality is not revealed until after purchase, the con- 
sumer’s problem is essentially a two-stage problem. The first stage 
relates to the price and involves choosing an acceptance set for prices. 
The second stage relates to the quality, and involves deciding whether 
a particular quality is sufficiently good (a) to retain it after purchase 
and (b) to continue buying it thereafter. The problem is made in- 
teresting and nontrivial by the dependence of quality on price. 
The major result is the finding that the price-acceptance set is not 
necessarily connected and that the conventional reservation price rule 
is optimal only in some circumstances. In the general case, depending 
on the stochastic relationship between quality and price, buy cheap, 
buy expensive, buy medium, or various combinations of these are all 


plausible buying strategies. 
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In the form presented in the paper, the consumer’s utility is as- 
sumed to be linear in price and quality. However, all our results still 
hold if utility is a separable function of price and quality, as long as the 
distributions of price and quality are reinterpreted appropriately (as 
distributions of the utility of price and the utility of quality). Of 
course, in such a generalization optimal behavior would vary from 
consumer to consumer, depending on the differences between their 
utility functions. It would be of interest to explore the implications for 
price/quality equilibria in markets resulting from such behavior. It is 
to be hoped that this paper has laid a firm foundation for such an 
exploration. 
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Starting from an initial (baseline) budget line, income-compensated 
price changes always resulted in substitution effects consistent with 
the Slutsky-Hicks theory. This behavior cannot be explained by a 
simple random-behavior model. Similar changes in relative prices 
that did not originate from the initial (baseline) budget line resulted 
in “undersubstitution effects”: The composition of consumption 
changed in the expected direction, but the magnitude of change was 
not large enough to be consistent with the initial commodity bundle 
chosen. These undersubstitution effects are not explainable by 
shifting preference patterns or anchoring effects found in inconsis- 
tent choice sequences with human subjects. 


This paper reports tests of the negativity of the Slutsky-substitution 
effect in commodity-choice experiments with pigeons as subjects. 
Starting from a given (baseline) budget line, income-compensated 
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increases or decreases in relative commodity prices always resulted in 
substitution effects consistent with the Slutsky-Hicks theory. These 
results, which replicate earlier studies using rats as subjects (Kagel et 
al. 1975, 1980), are not explainable by simple random-behavior mod- 
els proposed in the literature (Becker 1962; Chant 1963). 

Similar changes in relative prices that did not originate from the 
baseline resulted in “undersubstitution effects”; that is, the composi- 
tion of consumption tended to change in the expected direction, but 
the changes were not large enough to be consistent with the original 
(baseline) commodity bundle consumed. These undersubstitution ef- 
fects are not explainable in terms of shifting preference patterns or 
anchoring effects (Tversky and Kahneman 1974) found in inconsis- 
tent choice sequences with humans as subjects. Further experiments 
will be required to identify the basis for this behavior. 

Data on learning patterns as subjects adjust to changes in experi- 
mental conditions are also reported. Questions of interest here con- 
cern identifying uniformities in both the speed of the adjustment 
process and its pattern, whether it can be characterized by concepts of 
habit formation or stock-adjustment processes (Houthakker and 
Taylor 1970). The nature of the learning process is, of course, strictly 
an empirical matter in terms of the static Slutsky-Hicks theory. 

Space considerations do not permit a detailed discussion of the 
rationale behind using nonhuman subjects to investigate economic 
theories (see Kagel and Battalio [1980] for this argument). For the 
more skeptical reader we note that if one defines economics as “. . . the 
study of the allocation of scarce resources among unlimited and 
competing uses” (Rees 1968), then animal psychologists, ecologists, 
and biologists have been involved in studying economic behavior for 
some time now (Rapport and Turner 1977; Hirshleifer 1978; Rachlin 
1980). It is but a small step to take the technologies of these related 
disciplines and apply them to behavior of interest to economists, for 
example, commodity-choice behavior. At a minimum, such studies 
expand considerably the scope for comparative economic analysis. At 
a maximum they provide a laboratory for identifying, testing, and 
better understanding general laws of economic behavior. Use of this 
laboratory is predicated on the fact that behavior as well as structure 
vary continuously across species, and that principles of economic 
behavior would be unique among behavioral principles if they did not 
apply, with some variation, of course, to the behavior of nonhumans. 

The plan of the paper is as follows: Section I describes the hypothe- 
ses tested and the procedures used to operationalize the budget con- 
straint. Sections II and III report the two sets of experiments con- 
ducted: Both comparative static and temporal patterns of behavior 
are examined. We end with a discussion which summarizes the test 
outcomes and their implications for future research and returns to 
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the question of the relevance of the experiments for understanding 
human economic activity. 


I. Hypotheses Tested and Experimental Procedures Employed 


A necessary condition that all members of the class of Slutsky-Hicks 
demand functions must satisfy is that for income-compensated price 
changes (of the Slutsky type) the consumer purchase a commodity 
bundle that was previously unattainable. More precisely, 


if p' A p’ and p’x' = p’x’ then (pi — p*)(! — x*) < 0, (1) 


where p' and x' represent vectors of prices and quantities of goods 
purchased in period 2. Condition (1), also referred to as the weak 
axiom of revealed preference, provides the basis for a direct consis- 
tency test of the theory (Hicks 1956). 

Condition (1) was tested by having food- and water-deprived pi- 
geons choose between the consumption of these two commodities. 
The subjects were placed in an experimental chamber for a fixed 
period of time each day. Each chamber contained a food hopper and 
water reservoir located on the front panel. Access to the food and 
water was controlled by separate variable time schedules. Under a 
variable time schedule commodities become available for a fixed 
period of time every @ seconds on the average, where a@ is determined 
as part of the experimental design. Only one of the variable time 
schedules was in operation at a given time, and subjects could switch 
between the food and water schedule by a single peck on a control 
key. Pecks on this key were not required to continue food or water 
deliveries, but only to change which commodity would be delivered. 
Color-coded lights were used to signal whether the food or water 
schedule was in effect. The price of each commodity was varied by 
altering the average time between deliveries of that commodity (access 
times remaining constant), while the income constraint was the total 
time available for the delivery of the two goods. 

Under these procedures time spent under the food and water 
schedules (Tp, Tw) produces commodities at an average rate of 

1 1 

Gie= Silly, Cpe mee oy, (2) 

Oy ay 
where a@p, @y are the average values of the variable time schedules, 
and C, and Cy are food and water consumption measured in number 
of payoffs. Since time required to switch between schedules is 
sufficiently small that it can be ignored for all practical purposes, 
substituting (2) into the total time (77) constraint 


Tr == Ty at Tp (3) 
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gives an expected value for the budget constraint of 
ApCp Siz ay w — Tr. (4) 


From (4) it is clear that increasing (decreasing) the average time 
interval between deliveries of a commodity has the same impact as a 
price increase (decrease) in the usual specification of the budget 
constraint. Analogously, increasing (decreasing) the total time availa- 
ble for delivery of the goods increases (decreases) nominal income 
(see Becker [1971] for the comparative statics of this problem). 

To establish conditions for testing the negativity of the Slutsky- 
substitution effect, consumption patterns were first recorded under a 
set of initial (baseline) price (a, ajy) and income (T7) conditions. The 
price of one of the commodities was then increased, accompanied by a 
corresponding decrease in the price of the other commodity so as to 
achieve, as nearly as possible, es compensation 


abr + ayCly = a-Ch + ayCiy, (5) 


where the superscripts refer to the initial (¢) and price-change period 
(7). Since commodity consumption varied somewhat from day to day 
under constant experimental conditions, the median bundle con- 
sumed during the last 5 days of a condition served as the reference 
point for compensation. Total time available for consumption re- 
mained constant across experimental conditions, as did the amounts 
of food and water delivered per payoff. 

Consistent choice behavior (a negative Slutsky-substitution effect) 
requires that 

Tr — (aC + aC) < 0. (6) 
That is, with the compensated change in relative prices the consumer 
purchases a commodity bundle in the price-change period that it 
could not afford to purchase during baseline. Since, as already noted, 
consumption was not the same from day to day under constant ex- 
perimental conditions, t-tests applied to the mean value of (6) calcu- 
lated over the last 10 days of the price-change period were used to 
determine consistency. 

In cases where (6) is negative, a simple random-behavior process 
could explain the results (Becker 1962; Chant 1963). For example, if 
the pigeons randomly choose the amount of time spent receiving each 
of the commodities, and total time remains constant across experi- 
mental conditions, then reducing the time between food deliveries, 
which corresponds to a reduction of food costs, will result in in- 
creased food consumption. Correspondingly, increasing the time 
between water payoffs, which increases its price, would result in a 
decrease in average water consumption. Under random behavior the 
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expected value of time spent receiving each of the commodities 
would be 


aC, a VV 
Tanita HEP PG 3 221? 


(7) 


where q; is Constant across experimental conditions.’ 

Chant has characterized consumers who behave in this fashion as 
“money deciders.” In the case where q; = .5, the pigeons would be 
spending an equal amount of time collecting each of the commodities. 
In Chant’s terminology they would be “impulsive money deciders.” 
For the more general case, where the pigeons spend a constant but 
unequal proportion of time collecting each of the commodities, they 
would be characterized as “proportionately inert money deciders.” In 
cases where (6) is negative, we will test whether behavior is 
significantly different from (7) using t-tests for paired comparisons 
and appropriate analysis-of-variance procedures for comparing across 
several conditions. Rejection of (7) suggests that something other than 
a simple random behavioral process is guiding choices. 


II. Experiment 1 
Procedures 


Four male White Carneaux pigeons with no previous experimental 
history served as subjects (S’s). Subjects were completely dependent 
on food and water consumed during experimental sessions with no 
access to these or any other commodities in their home cages. Ex- 
perimental sessions were run once a day 7 days a week starting at 
approximately the same time each day. Consequently, subjects’ body 
weight varied somewhat between conditions depending on consump- 
tion patterns. 

All S’s had extensive training with the procedures prior to the start 
of the experiment. Experimental conditions were changed when in- 
spection of graphs of the data indicated an absence of any significant 
drift in consumption patterns, although at times conditions were 
maintained for considerably longer periods simply to see what would 
happen. 

Food payoffs consisted of 8 seconds access to a food hopper con- 
taining mixed pigeon grains. Fluid payoffs consisted of 3 seconds 
access to a reservoir containing tap water. Access to the food hopper 


' Negative values for (6) rule out the cases of “inert” or “impulsive goods deciders” 
(Becker 1962; Chant 1963). Of course, behavior in accordance with (7) is observation- 
ally indistinguishable from behavior being motivated by a Cobb-Douglas utility function 


(Becker 1962). 


WA JOURNAL OF POLITICAL ECONOMY 


and water reservoir was blocked at all other times. Time spent eating 
and drinking did not count against total time available for delivery of 
commodities (T7), which was 60 minutes for all S’s.? 

For three of the four S’s, food and water were delivered on the 
average at 60-second intervals during the initial (baseline) period. 
This was followed by an increase in the relative price of food. Baseline 
conditions were then reinstated, followed by a decrease in relative 
food prices, which was followed by a return to baseline conditions. 
(Table 1 shows the sequence and magnitude of relative price 
changes.) To account for any shifts in preference structure, the me- 
dian commodity bundle taken in the last 5 days of each baseline 
served as the reference point for the Slutsky-compensated price 
change of the following period. A fourth subject, S 85, started with 
food payoffs coming twice as often as water deliveries (relative price 
of food equal to .5), followed by a Slutsky-compensated increase in the 
price of food. Baseline conditions faced by the other birds were 
implemented in period 3 for this subject. Since the mean commodity 
bundle purchased during period 3 could very nearly have been 
bought during the previous period, we compare these two points also. 


Comparative Static Results 


The effects of the Slutsky-compensated price changes are reported in 
table 1. The results of the increase in food price relative to baseline 
conditions are reported between conditions | and 2 for S’s 14, 46, and 
50 and between conditions 2 and 3 for S 85. In all cases the weak- 
axiom test statistic (6) has the correct sign and is statistically significant 
at the .05 level for three of the four S’s. Price elasticities for water 
ranged from a low of —.07 for S 50 to a high of —.81 for S 14, 
averaging —.38 across all four birds. These price elasticities are a bit 
larger than the highest values reported for food-fluid experiments 
with rats (Kagel et al. 1980). The increase in the relative price of food 
for S 85 between periods | and 2 also resulted in substitution effects 
in the expected direction. 

Decreases in the relative price of food for S’s 14, 46, and 50 in 
period 4 resulted in a negative (correct) sign for the weak-axiom test 
statistic, which was significant at the .05 level or better in all cases. 
Price elasticities here ranged from a low of —.10 for S 50 toa high of 
—.24 for S14, averaging —.16 across S’s. 

Replication of baseline conditions following the price-change 


* Initial values were determined so that S’s could maintain healthy body weights from 
food and water consumed within the sessions. Values were determined on the basis of 


past experiments and experience during the training periods. Additional procedural 
details may be found in the Appendix. 


TABLE 1 


RESULTS OF CONSISTENCY TESTS: EXPERIMENT | 








F-TEST FOR 
EQUALITY OF 


MEAN Payorrs? BASELINES 

SUBJECT AND WEAK-AXIOM (Sy) (Marginal 

EXPERIMENTAL TEST ee Significance 
CONDITION arlay (t-Statistic)@ Food Water Level) 








Subject 14: 
1 


1.0 46.4 13.6 
S fe (1.54) (1-54) 
2 2.88 : 39.5 31.1 
(2.20) (6.55) 
3 1.0 oe 37.1 23.3 3.36%* 
(3.05) (2.92) (.05) 
4 28 C2 48.7 165 
(2.21) (.82) 
5 1.0 42.4 18.2 
Gesy) Cae) 
Subject 46: 
1 1.0 34.2 25.8 
e Bae (1.47) (1.47) 
2 2.86 32.8 35.5 
(1.63) (4.65) 
3 1.0 37.2 29.9 94 
988 (1.58) (1.57) (.40) 
4 22 (2.79)** 43.7 21.1 
(2.08) (41) 
g 1.0 36.0 23.4 
(ele 1na.64) 
Subject 50: 
1 1.0 411 19.1 
Cae (1.62) « ~ (1.55) 
2 4.0 : 40.0 20.8 
(.98) (3.66) 
3 1.0 oe 42.6 v7.7 2.39 
(.82) (.92) (11) 
4 32 ee 47.2 15.6 
(1.58) (.50) 
: 1.0 44.7 16.0 
(1.04) (94) 
Subject 85: 
1 S 33.5 33.4 
‘ eg (1.59) -(.82) 
2 4.5 31.3 42.0 Ah 
~798 (1.33) (4.74) 
3 1.0 (3.88) #** 31.4 28.2 
(3.28) (3.31) 





Note.—a;-/ay = relative food cost; N.A. = not applicable; Sy = standard error of the mean. 
2 The t-tests are two tailed with 9 degrees of freedom. 
>» Calculated over the last 10 days of each period. 
* Significant at the 10 percent level. 
** Significant at the 5 percent level. 
*** Significant at the 1 percent level. 
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TABLE 2 


Test oF SIMPLE RANDOM-BEHAVIOR MODEL 











Income 
Spent Relative F-Test for Equal _ 

Subject and on Food Experimental Price Percentage Hypothesis 
Grouping* (%) Condition of Food (Marginal Significance Level) 
Subject 14: 

A 19 2 2.88 

A hed 1 1.00 

A,B m1 5 1.00 ain 

B .62 3 1.00 : 

C 46 4 .28 
Subject 46: 

A a 2 2.86 

B .62 3 1.00 

B 60 5 1.00 32.7 

B 57 1 1.00 (.001) 

C aod) 4 oe 
Subject 50: 

A 89 2 4.00 

B 15 5 1.00 

B Bal 3 1.00 55.4 

B .69 ] 1.00 (.001) 

C 50 4 nO 
Subject 85: 

A .78 2 4.50 

B 52 3 1.00 34.8 

Cc 34 1 50 (.001) 








* Conditions with the same grouping letter have the same percent of total expenditure devoted to food according 
to Duncan’s multiple-range test (a = .05). 
{ F-statistic excluding period 1 equals 14.7. 


periods showed consumption patterns to be reversible, at least di- 
rectionally, in all cases. Preferences were sufficiently stable that for S’s 
46 and 50 there were no significant differences in consumption pat- 
terns between baseline periods. While consumption for S 14 in period 
1 is found to differ significantly from that of periods 3 and 5 (at the 
.05 level using Duncan’s multiple-range test [Winer 1971]), there are 
no differences between periods 3 and 5. We note that the ratio of 
food to water consumption for S 14 in period 1 was the highest value 
found across all S’s in both experiments under baseline prices. 
Table 2 reports tests of the simple random-behavior model (7). In 
all cases F-tests show that we can decisively reject the hypothesis that 
the percentage of total income spent on commodities was the same 
across differing experimental conditions. Using the letters 4, B, and C 
to indicate experimental conditions that can be grouped together as 
having the same percentage of income spent on food (at the .05 
significance level according to a Duncan multiple-range test), we see 
that the price-change periods are responsible for the differences in 
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each case. For S’s 46 and 50 the percent of total income spent on food 
remained constant across baseline periods (conditions 1, 3, and 5) but 
increased significantly in period 2 when relative food prices increased 
and dropped significantly in period 4 when food prices dropped. 
Subjects 14 and 85 show similar response patterns with the exception, 
already noted, that consumption patterns differed between baseline 
periods for S 14. In all cases demand is far too inelastic for consistency 
to be attributed to the fact that S’s spent a constant proportion of their 
time collecting each of the commodities. These pigeons cannot be 
characterized as either “impulsive” or “proportionately inert” money 
deciders. 


Adjustment Patterns 


The comparative static analysis tells us nothing about the learning 
path as subjects adjust to changes in relative prices. To investigate this 
we fit, within a given experimental condition, the following hyperbolic 
function to the commodity whose relative price decreased: 


Crt =at b(1/t), 


Cy: = consumption of commodity k on day t, commodity k 
being the good whose relative price decreased, 


(8) 


~ 


= time in days within a given experimental condition. 


The function (8) builds in the assumption that consumption ap- 
proaches a steady-state value as time passes and there are no further 
changes in experimental conditions. We fit these functions because (1) 
in terms of conducting the experiments, conditions were only 
changed when inspection of the data suggested no significant drift in 
consumption patterns, and (2) graphs of the data suggested that in 
most cases we did not need to use more complicated functions to 
adequately represent the time path. 

The coefficient b of (8) is readily interpretable in terms of a number 
of concepts within the economics and psychology literature. If b is not 
significantly different from zero we have, in effect, instantaneous, or 
at least very rapid, adjustment to the price changes, relative to the 
day-to-day variability in consumption patterns. Fitting (8) to the 
commodity whose relative price decreased under a given experimen- 
tal condition and finding a negative value for b would imply that 
subjects responded gradually to the change in relative prices, in- 
creasing consumption of the now cheaper good with the passage of 
time. Such behavior would be consistent with concepts of habit for- 
mation found useful in describing human consumption for a number 
of nondurable goods (Houthakker and Taylor 1970; Phlips 1974). 
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The presence of habit formation would also be consistent with psy- 
chological notions that it takes S’s time to perceive the changes in 
relative prices and even more time to learn the implications of re- 
sponding to these changes. 

In contrast, a positive sign for 6 in (8) would indicate that the 
immediate response to a decrease in relative prices was to consume 
more of the commodity than with the passage of time. This sort of 
response pattern goes under the heading of a stock-adjustment effect 
within the Houthakker-Taylor framework. It might be present in our 
data for any number of reasons. 

In general there is no a priori basis for deciding whether habit- 
formation or stock-adjustment effects will predominate for a particu- 
lar commodity. For example, Houthakker and Taylor (1970) found 
that housing exhibited strong habit-formation effects, while domestic 
services and water use showed stock-adjustment effects generally con- 
sidered to be characteristic of consumer durable goods. The empirical 
work in psychology shows much the same diversity since learning 
curves of almost every conceivable shape have been reported in the 
literature (Mazur and Hastie 1978). 

Figure | shows the daily water consumption for condition 2 when 
the relative price of water decreased. The mean consumption of water 
during the last 10 days of the previous period is also shown as an open 
circle on the water axis. The estimated b values and the Durbin- 
Watson statistics obtained from the least-squares estimates of equation 
(8) are also given for each subject.*? The data show little trend and a 
great deal of variability. This is evident in the least-squares estimates 
of the time-trend coefficients, b, which are not statistically significant 
for any of the S’s. Note, however, that in each case b is positive in sign, 
so that what little trend there is in the data is inconsistent with the 
notion that habit formation dominates choices. That is, there is a 
tendency to have more substitution into water during the first few days 
than is shown in the comparative static comparisons. 

Figure 2 shows the daily food consumption for condition 4 when the 
relative price of food decreased. Once again there appears to be little 
trend, a result captured in the ¢-test ratios for the least-squares esti- 
mates of b, which are insignificant in all cases, although the signs of 
the coefficients are somewhat more favorable to the habit-formation 
hypothesis. 

The absence of significant time-trend coefficients, in conjunction 
with the data reported in table 1, implies that $’s adjusted very quickly 
to the changes in relative prices, at least within the time frame for 


* The breaks in the sequential data plotted in figs. 1-4 correspond to days when 
the equipment malfunctioned. Dummy variables are used in these cases when adjusting 
(8) to the data. 
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Fic. 1.—Temporal pattern of water consumption following a decrease in its relative 
price: experiment 1, condition 2. Broken lines indicate days with equipment failures. 
Estimated values of the time trend coefficient, b, with the corresponding ¢-statistic 
reported in brackets are given for each subject along with the value of the Durbin- 
Watson statistic. The symbols +, *, and ** indicate statistical significance at the 10, 5, 
and 1 percent levels, respectively. The symbols — and o indicate the Durbin-Watson test 
was indeterminate or there was an insufficient number of observations for the test. 


which we have data (daily totals) and relative to the day-to-day vari- 
ability in the data. For example, in going from condition | to 2, S$ 14 
more than doubled water intake but has a time-trend coefficient with 
a t-statistic considerably less than 1.0. The frequent trips to the 
cafeteria windows (S’s received 60 or more payoffs per session), the 
low search costs involved in choosing within the confines of the ex- 
perimental test chamber, and the physiological limits on substitutabil- 
ity imposed by the commodities employed all facilitate S’s adjusting 
rapidly to the changes in experimental conditions. The data here 
indicate that at least some of the subjects took advantage of these 
conditions. 

The Durbin-Watson statistics, while indicating a tendency to nega- 
tive first-order autocorrelation in the residuals, do not show a serious 
problem in these respects. Such a problem might be anticipated from 
the physiological interdependencies between food and water, which 
would result in the pigeon’s getting caught short on food or water for 
a given day, only to be compensated for in the next day’s consumption 
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Fic. 2.—Temporal pattern of food consumption following a decrease in its relative 
price: experiment 1, condition 4. Notation is the same as in figure 1. 


bundle.* The data indicate, however, that this did not happen. Sub- 
jects either had sufficiently good “planning,” given the frequent trips 
to the cafeteria window in a given day, or sufficiently frequent cross- 
overs between food and water schedules so as not to generate imbal- 
ances strong enough to dominate the residuals. Durbin-Watson 
statistics computed on the basis of fitting (8) to the data from the 
baseline periods show similar results. 


III. Results for Experiment 2 


Procedures 


Essentially the same procedures were employed as in experiment 1. 
The major difference was the sequence of price changes. 

Four experimentally naive male White Carneaux pigeons served as 
subjects. Initial (baseline) conditions were the same as for S’s 14, 46, 
and 50 in experiment | with the exception that § 38 obtained 13 


* We are assuming that $’s can and will correct imbalances within one day. This 
assumption is based on data for days following equipment failures (e.g., a malfunc- 
tioning food dispenser) which show that, for rats and pigeons alike, excluding data for 
the day of the failure and the following day generally leaves the time trend unaffected, 
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seconds access with each food payoff and 6 seconds access to the water 
reservoir. The difference in price-change sequence from experiment 
1 consisted of (1) decreasing relative food prices first and (2) not 
returning directly to baseline conditions but following this with a 
compensated increase in food prices relative to baseline before re- 
turning to baseline conditions. (Table 3 shows the sequence and 
magnitude of price changes.) 


Comparative Static Results 


Table 3 reports the effects of the compensated price changes. The 
decrease in relative food prices in period 2 led to statistically 
significant substitution effects in the expected direction in all cases. 
The price elasticities of demand for food ranged from a low of —.21 
for S 38 to a high of —.51 for S 36, averaging —.32 across S’s. 
Applying t-tests to the percentage of total time spent obtaining food in 
periods 1 and 2, we again reject the hypothesis that the simple 
random-behavior model (7) explains any of the data: Demand is far 
too inelastic in this case also. 

The compensated increase in relative food prices in period 3 does 
show some surprises, however. In all cases the relative price of food 
was greater than during the initial baseline period. However, the 
predicted magnitude of change in consumption did not materialize 
for any of the S’s. Although the composition of consumption (ratio of 
food to water payoffs) changed in the “right” direction from period 2 
to 3 for three of the four S’s, in no case was the change sufficiently 
large to result in a correct sign for the weak-axiom test statistic (6). In 
contrast, comparable increases in relative food prices starting from 
baseline conditions in experiment | resulted in a correct sign for the 
test statistic in all cases. Further, a comparable increase in relative 
food prices for S 85 in period 2 of experiment 1, which started from a 
relative price ratio similar to that faced by S’s 36-39 in period 2 of 
this experiment, did result in consistent substitution effects significant 
at the 10 percent level.’ 

One potential explanation for the behavior observed in period 3 is 
that preferences had shifted. The return to baseline conditions in 
period 4 enables us to test this hypothesis. For the one subject, S 36, 
whose behavior is clearly inconsistent in period 3, preferences have 
clearly shifted (¢ = 5.15, df = 18, P < .001).° For the other three S’s 
showing nonexistent or undersubstitution effects, the hypothesis 
receives only weak support. Although food consumption has in- 


» Real income was somewhat higher for S 85 in period 1. 
6 P < .001 = probability that we can reject the null hypothesis of no effect. 


TABLE 3 


RESULTS OF CONSISTENCY TESTS: EXPERIMENT 2 











t-TEST FOR 
MEAN EQUALITY OF 
PAYOFFS” BASELINES 
SUBJECT AND WEAK-AXIOM (Sy) (Marginal 
EXPERIMENTAL ‘TEST Significance 
CONDITION aplay (t-Statistic)# Food Water Level)* 
Subject 36: 
1 1.0 24.1 35.9 
— 1032 (1.26) (1.26) 
(11.3)*** 
2 2D 47.5 29.7 
546 (1.89) (.37) 5.15 *** 
2K 
3 9.02 Gay 27.6 23.3 001) 
(37) (3.15) 
4 1.0 Bee 26.8 
(1.24) (1.24) 
Subject 37: 
1 1.0 41.2 18.8 
~312 (47). (147) 
(4.24) *** 
2 50 318 49.6 15.6 
(1.88) (2.46) E27) 93 
3 4.0 : 42.1 12.6 (37) 
(.90) (a7) 
4 1.0 43.8 16.2 
(2.39) (2.39) 
Subject 38: 
1 1.0 45.0 15.1 
— 228 (1.30) (1.35) 
(3.90)*** 
2 53 54 51.2 226 
(1.03) (2.05) (1.09) .78 
3 2.60 45.1 14.0 (.45) 
(67) (1.48) 
4 1.0 47.0 13.0 
(2.21) (2.21) 
Subject 39: 
1 1.0 39.9 20.2 
— 144 (2.00) (2.01) 
(2.51)** 
2 53 47.0 15.4 
0.0 (1.58) (1.06) 1.78* 
3 2.40 (0.0) 43.1 16.9 (.09) 
Cry haze) 
4 1.0 44.1 15.9 
(1.25) (1.25) 





Nore.—a;/ay = relative food cost; Sy = standard error of the mean. 
@ The ¢-tests are two tailed with 9 degrees of freedom. 
» Calculated over the last 10 days of each period. 

© The t-tests are two tailed with 18 degrees of freedom. 


* Significant at the 10 percent level. 
** Significant at the 5 percent level. 
*** Significant at the | percent level. 
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Fic. 3.—Temporal pattern of food consumption following a decrease in its relative 
price: experiment 2, condition 2. Notation is the same as in figure 1. 


creased in period 4 relative to the initial period for all three, in no case 
does the increase seem to be sufficient, by itself, to explain the 
undersubstitution effects. The magnitude of the change in relative 
prices, and the fact that we are dealing with Slutsky-compensated 
price changes which, if anything, overestimate the Hicksian substitu- 
tion effect (Friedman 1976), would seem to more than compensate 
for any small shifts in preference structure that might have occurred.” 
Finally, the behavior reported in period 3 is inconsistent with the 
random-response model (7) also. 


Adjustment Patterns 


Figure 3 shows food consumption in period 2 following the decrease 
in its relative price. The data show more of a positive time trend, 
consistent with the notion of habit formation, than comparable obser- 
vations from experiment 1. Estimated values of the time-trend 
coefficient, b, from equation (8) confirm this as we have negative 
coefficients in three of the four cases, each one of which is significant 
at the 5 percent level. These negative coefficients indicate that sub- 
jects responded slowly to the decrease in relative food prices, consis- 
tent with notions of habit formation. 


7 We are assuming here that both food and water are normal goods. 
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Fic. 4.—Temporal pattern of water consumption following a decrease in its relative 
price: experiment 2, condition 3. Notation is the same as in figure 1. 


In contrast, the data for period 3, shown in figure 4, during which 
the relative price of water decreased, show stock-adjustment effects in 
all cases, being statistically significant at the 1 percent level for S 37 
and at the 10 percent level for S$ 38. Thus, the initial response to the 
decrease in the price of water was greater at the beginning of this 
period than at the end. In fact, for all S’s the commodity bundle 
purchased on the first day of period 3 satisfied the basic consistency test 
(1). Further, for S’s 37-39, who did not shift preferences between 
baseline periods, the weak-axiom test statistic calculated over the first 
3 days of period 3 has the correct sign, although it is somewhat 
smaller in magnitude than values observed under comparable condi- 
tions (period 2) in experiment 1. These data would appear to rule out 
arguments that the undersubstitution effects found in period 3 were a 
result of not maintaining experimental conditions for a sufficiently 
long period of time and/or that S’s failed to correctly process the 
magnitude of the changes in relative prices. 

The adjustment patterns in the two experiments are qualitatively 
similar. Pooling the results from the two experiments shows an 
asymmetric response to increases and decreases in relative food 
prices. For decreases in prices we find a tendency to habit formation, 
or increased food consumption with the passage of time. For increases 
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in relative food prices the data consistently show anti—habit forma- 
tion, more water (less food) consumed immediately following the 
price change than later on. These conclusions are reached whether we 
combine the t-test outcomes for the price-change periods (Z = —2.28, 
P < .03 for the t-values associated with b in periods when relative food 
prices decreased; Z = 8.0, P < .001 for the t-values associated with b in 
periods when relative water prices decreased) or use a x? analysis 
based on the signs of the time-trend coefficients (7 = 9.6,df=1,P< 
cE) 

The asymmetric response to increases and decreases in relative 
food prices is quite unexpected. One possible explanation lies in 
physiological asymmetries with respect to how animals process food 
and water. Food can be stored in the form of enhanced body weight, 
while the experimental procedures do not permit maintaining com- 
parable stocks of water. Thus, in the case of increased food prices, 
there is a stock of food available to buffer large and immediate 
increases in water consumption. But with decreases in the price of 
food, there is not a stock of water to produce similar results. Further, 
food-buffer stocks were larger in experiment 2 at the time food prices 
increased since it came immediately after a large decrease in food 
prices in the previous period. This might have contributed to the 
stronger stock-adjustment effects found in this experiment. 

Although adjustment patterns in the two experiments are qualita- 
tively similar, a question arises as to what factors are responsible for 
the quantitative differences in the ¢-ratios and the size of the time- 
trend coefficients between the two experiments. The size of the re- 
ductions in relative food prices during condition 2 of experiment 2 is 
comparable to that faced by subjects in experiment | (see tables 1 and 
3). Further, the length of the baseline periods preceding these price 
changes was similar in the two experiments. This would seem to rule 
out arguments that these quantitative differences result from the 


* Under the hypothesis that the mean value for the t-statistic in the population is zero, 
the statistic, 


Dt; 
SiG 2 


where f; represents the degrees of freedom associated with ¢;, has a sampling distribu- 
tion which is approximately N(0, 1) (Winer 1971). The picture gets a bit fuzzier if we 
include periods of return to baseline (conditions 3 and 5 in experiment 1, condition 4 in 
experiment 2) to the analysis. The Z-statistic for t-values when relative food prices 
decreased loses significance (Z = — 1.12, P = .26), while for periods when relative water 
prices decreased the significance level drops (Z = 2.60, P < .01). Also, the significance 
level for the x? test drops (x? = 4.5, df = 1, P < .04). The primary reason for this is that 
the sign of the b coefficient associated with food consumption in period 4 of experiment 
1 is positive for S’s 37-39, while to be more consistent with the price change periods it 
should be negative since food prices decreased going into this period. 


a 
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magnitude of price changes or the length of time prices had remained 
stable. This leaves individual subject differences or differences in 
procedures (sequence effects or some undetected change) between 
the two experiments as the most likely factors responsible for the 
differences. 

Attributing part of the differences to variability in individual sub- 
ject responses receives some support. The learning-curve literature in 
psychology shows a lot of variability in response patterns using 
single-subject data, with some individual learning curves being very 
irregular or varying greatly from the general form for a given task 
(Mazur and Hastie 1978). Examination of the magnitude of the esti- 
mates of ) within each experiment reveals a similar variability in 
behavior. For example, in both experiments one of the four subjects 
has a positive point estimate when the relative price of food de- 
creased. Similar differences in individual subject responses are, of 
course, not detectable in aggregate per capita data such as Houthak- 
ker and Taylor employed. However, the consistent negative signs of 
the time-trend coefficients for decreases in relative food prices and 
positive signs for increases in relative food prices (decreases in the 
relative price of water) suggest that an aggregate per capita analysis, 
under conditions where all subjects faced the same relative price 
changes, would clearly exhibit these patterns. 

Finally we note that the values for the Durbin-Watson statistics 
reported in figures 3 and 4 are comparable to those found in experi- 
ment 1. While there is a tendency to negative first-order autocorrela- 
tion in the residuals, these effects are rarely strong. Even though S’s 
would readily consume more at the end of a session if given the 
opportunity, they managed to keep food and fluid intake relatively 
well balanced on a daily basis. 


IV. Summary and Discussion 


Using figure 5 we can readily summarize the consistency test results. 
Starting from some initial (baseline) budget line (line 1), with con- 
sumption bundle 4, Slutsky-compensated increases or decreases in 
relative prices result in consistent and statistically significant substitu- 
tion effects (points B and C, respectively). This result replicates similar 
tests of the Slutsky-Hicks theory using rats pressing levers and 
choosing between food and water, or flavored fluids with ad lib. 
access to food and water (Kagel et al. 1975, 1980). These results 
suggest the universality and robustness of one of the fundamental 
“truths” of the static theory of consumer demand, that compensated 
price increases will bring about a decrease in quantity demanded. 


COMMODITY-CHOICE BEHAVIOR 85 


Food 
Payoffs 





Water Payoffs 


Fic. 5.—Summary of consistency-test results in terms of commodity-choice space. 
Numbered lines are budget lines under differing relative prices with capital letters 
characterizing choices. See text for details. 


Further, this truth does not rest on simple random-behavior models 
suggested by Becker (1962) and Chant (1963) but apparently reflects 
a basic behavioral principle. 

Large rotations in the budget line which involved swinging from 
line 2 to 3 without stopping off at baseline resulted in inconsistent 
choice behavior (point C’) relative to the initial commodity bundle 
consumed. While S’s tended to alter their composition of consump- 
tion in the expected direction, they did not change enough to be 
consistent with the initial commodity bundle consumed.’ Efforts to 
explain these undersubstitution effects in terms of shifting preference 
structures and/or delayed reactions (learning) in response to the 
change in relative prices did not prove successful. 

Although it would be nice to have an explanation for the inconsis- 


° We searched our rat data for similar rotations in the budget line. In the one case this 
happened the results are similar to those reported here. This was overlooked at the 
time since further rotation of the budget line in the same direction resulted in addi- 
tional changes in consumption so that the weak-axiom test statistic was satisfied. 
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tencies reported here in terms of established patterns of behavior 
derived from the choice experiment literature, we have not found one 
which is completely consistent. For example, anchoring effects, the 
fact that for humans estimates of quantities (Tversky and Kahneman 
1974) and strengths of sensory phenomena (Stevens 1975) are greatly 
affected by the starting point used in making these estimates, might 
be used to explain the differences in going from budget line 2 to 3 
directly rather than from | to 3. But anchoring, unless conjoined with 
a hypothesis specifying some minimum difference between points 
before it takes effect, would also seem to imply systematic biases in 
consumption patterns in returning to baseline conditions following a 
relative price change, something we did not observe in experiment 1. 
Explaining the inconsistencies in terms of changes in the animal's 
body weight resulting from the relative price changes runs into simi- 
lar problems, although we cannot rule out more complex physiologi- 
cal interaction effects. Perhaps the behavior simply results from S’s 
inexperience, so that once having experienced a compensated price 
change starting from baseline prices, the same commodity bundle 
would be chosen starting from elsewhere in the choice space. 
Identification of the factors underlying the inconsistencies must wait 
further experimentation. Identifying these factors would also help 
clarify the behavioral mechanism underlying consistent choices. 

Looking at day-to-day adjustments in consumption patterns result- 
ing from changes in relative prices we find a number of cases in which 
adjustments were completed within the first observation period and 
other cases with statistically significant learning effects. Individual 
subject differences rather than the size of the price changes or other 
environmental factors were cited as one of the factors behind these 
differences. The fact that the adjustment coefficients tended to have 
the same sign whether they were statistically significant or not sug- 
gests that an aggregate analysis under conditions where all subjects 
faced the same price changes would produce statistically significant 
learning effects. Unanticipated asymmetries in the adjustment path 
resulting from increases and decreases in relative food prices were 
also found. We are looking at adjustment patterns from the rat 
studies in an effort to confirm these regularities. 

Questions naturally arise concerning the relevance of the animal 
behavior reported here to understanding analogous human behavior, 
the behavior of primary interest to most economists. Objections 
might be raised on two grounds. One is that there are essential 
differences between the choice conditions (environmental conditions) 
the pigeons face and conditions facing human consumers operating 
in national economic systems. The pigeons are not given money (or 
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tokens, as in some human economic experiments) and told what 
prices are, but are simply faced with sequences of situations from 
which they must deduce what the rules of the game are. One might 
argue that this results in the pigeons facing a more difficult situation 
than is assumed in ordinary consumer-demand theory. Second, one 
might object that there are essential differences in the choice process 
itself, on what goes on inside the pigeon’s brain, so to speak, relative to 
human consumers. The sequential way in which subjects learn about 
changes in relative prices and the apparent sequential nature of 
choices even under steady-state conditions stand in marked contrast 
to the textbook characterization of a consumer who maximizes a 
utility function subject to a budget constraint at a point in time, 
apparently deciding on all purchases at once. We deal with these 
objections below. 

There is no question that the choice conditions the pigeons faced 
here have no exact physical analogue to choice conditions outside the 
laboratory for humans, or for pigeons in the wild for that matter. 
What an experiment aims for is to develop procedures in the labora- 
tory which provide a logically consistent interpretation for the con- 
cepts and relationships of the theory under investigation. Within the 
context of consumer-demand theory, this involves developing proce- 
dures that alter the choice set in the same way that the symbols 
commonly designated “price” and “income” alter choices in the 
general theory. This is true of any application of the theory, be it 
experimental or nonexperimental, using human or nonhuman sub- 
jects, so that there are always some differences in environmental 
conditions in different applications of the theory, and these differ- 
ences may affect decision outcomes. 

If the theory is to have any generality it must hold under a variety 
of such choice conditions. Consequently, confirmations and dis- 
confirmations are relevant to evaluating the truth status of the general 
theory under any and all logically consistent interpretations:'® 
Further, it is often the case that the most preferred conditions for 
testing a theory will be “atypical.” For example, one of the crucial 
experiments of relativity theory is to determine if stars behind the 
edge of the sun appear to be displaced from their known positions 
during a total solar eclipse. One does not object to such experimental 
observations simply on the grounds that they do not correspond to 
“normal” conditions (Smith 1980). Similarly, perfect compensation, 


‘© Identifying differences in behavior under different interpretations of the general 
theory sets off a search for the factors responsible for these differences and eventually 
results in modifications of the general theory to account for these factors. Unfortu- 
nately, this is often an exceedingly slow process. 
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which is required to directly apply the weak-axiom test (1), rarely 
occurs in nature but has an important role to play in better under- 
standing commodity-choice behavior. Establishing procedures for 
implementing these conditions using nonhumans is considerably less 
costly than with humans, while introducing tokens or some other 
conditioned reinforcer to serve as a medium of exchange and unit 
accounting, although technically feasible, would only complicate the 
experiment with little likelihood of seriously affecting behavior. In 
fact, we might argue that having subjects learn the rules of the game 
through experience rather than relying on price representations 
eliminates potential pyschological complications introduced by such 
representations in monetized economies (Russo, Krieser, and 
Miyashita 1975), proving a reference point against which to evaluate 
the predictions of the theory. 

The apparent sequential nature of the pigeons’ choices, while 
standing in contrast to notions of consumers possessing full informa- 
tion and making decisions simultaneously, poses no conflict with re- 
spect to observable characteristics of human choice in the mar- 
ketplace. It takes real time for consumption to occur; hence, real 
choices of all organisms have a sequential aspect that is commonly 
ignored in comparative static analysis. Further, differences in “plan- 
ning horizons” between humans and pigeons must, almost by defini- 
tion, be one of length rather than existence, unless one is willing to 
argue that the pigeons’ behavior is the result of some elaborate, 
genetically determined program. 

Finally, it must be remembered that whether or not pigeons or 
humans have consciously thought out their behavior is irrelevant to 
characterizing that behavior as a solution to a constrained optimiza- 
tion problem. As Samuelson (1947) notes, “... itis possible to formulate 
our conditions of equilibrium as those of an extremum problem, even 
though it is admittedly not a case of an individual’s behaving in a 
maximizing manner, just as it is often possible in classical dynamics to 
express the path of a particle as one which maximizes (minimizes) 
some quantity despite the fact that the particle is obviously not acting 
consciously or purposively.” In a competitive world all organisms 
must solve the problem that time and energy are limited and that 
efficient allocation of resources, over which the individual has control, 
is essential to success. The fact that man is capable of self-awareness 
and reasoning and is occasionally guided by it in solving mathematical 
puzzles, for example, does not imply that these factors are either 
necessary or sufficient to achieving efficient allocation of resources. 
Nor does this prove that human consumers in the marketplace are 
guided by these factors, while pigeons in the laboratory are not. 
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Appendix 
Procedural Details 


The experiment was conducted in a sound-insulated pigeon test chamber that 
measured 13 4 inches long, 11 % inches wide, and 14 inches high. The test 











TABLE Al 
No. of 
Subject and Days in 
Experimental Condition* Condition Qp ay 
Subject 14: 
1 5 60 60 
2 12 ri 25 
3 46 60 60 
4 16 34 120 
5 14 60 60 
Subject 36: 
1 20 60 60 
2 19 20 90 
3 16 120 13.3 
4 56 60 60 
Subject 37: 
1 46 60 60 
2 19 45 90 
3 20 80 20 
4 27 60 60 
Subject 38: 
] 18 60 60 
2 20 48 90 
3 15 72 Died 
4 48 60 60 
Subject 39: 
1 49 60 60 
g 17 48 90 
3 15 2 30 
4 31 60 60 
Subject 46: 
l 14 60 60 
2 12 80 28 
3 39 60 60 
4 25 26 120 
5 28 60 60 
Subject 50: 
1 8 60 60 
2 11 80 20 
3 50 60 60 
4 19 38 120 
5 ne 60 60 
Subject 85: , 
l 16 36 72 
2 22 90 20 
3 33 60 60 








Note.—a, = average time between food payoffs in seconds; ay = average time between water payoffs in seconds. 
* Mean value reported in the text for S’s 14 and 50 in condition 1 are over all days in this condition. 
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chamber had three response keys (2.5 centimeters in diameter, fabricated 
from clear acrylic plastic) located on the front panel where the food and water 
reservoirs were located. A single peck on the center response key switched 
modes of delivery from food to water and vice versa with no changeover 
delay. This key was transluminated with white light at all times. The other two 
response keys, located over the food hopper and water reservoirs, were color 
coded to indicate the delivery mode in operation. Responses on these keys 
did not affect the contingencies. General chamber illumination was provided 
by a 7-watt houselight. When commodities were delivered, the response keys 
were darkened but the houselight remained on. A fan provided ventilation, 
and constant white masking noise was provided. 

Food and water deliveries were programmed by separate variable time 
tapes. The tapes established a random sequence of interreinforcement times 
following an exponential distribution under all conditions. The sequence of 
interreinforcement times repeated itself every 20-30 reinforcements. 

Food reinforcements occurred only when in the food mode and consisted 
of the food hopper coming into reach and the food hopper lights coming on 
during reinforcement. Water reinforcement occurred only when in the water 
mode and consisted of pumping water into a reservoir at the rate of | 
milliliter per 15 seconds and operating a response feedback clicker every 14 
second. During reinforcement, pecks on the center response key did not 
result in a mode switch. 

Table Al shows the number of days each condition was in effect and the 
average absolute time delays (in seconds) between food and water payoffs. 
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Taste Change in the United Kingdom, 
1900-1955 





Steven E. Landsburg 


University of Chicago 


A search is made for violations of the axiom of revealed preference 
in aggregate consumption data for the United Kingdom. No such 
violations are found. This is taken as evidence that tastes remained 
constant throughout the period under study. The strength of this 
evidence is estimated within the context of a particular model of 
taste change. 


I. Introduction 


This paper investigates the phenomenon of “taste change” in the 
United Kingdom in the years 1900-1955. In the paragraphs which 
follow we shall describe a method of searching for evidence of that 
phenomenon. In Section II we shall carry out several versions of that 
search and find no such evidence. In Section III we shall develop a 
general model of taste change in the context of which we shall 
strengthen the negative result of Section II. Section IV is a further 
discussion of the model. In Section V we shall discuss some of the 
implications of our results for economic analysis. 

Our basic methodology is similar to that of Houthakker (1963) and 
Koo (1963). First we choose a set of goods. Then we choose a pair of 
years—say, for concreteness, 1900 and 1910. We observe what quan- 
tity of each good was consumed in 1900 (we shall refer to this set of 
quantities as the 1900 market basket), and we calculate what the price 
of this market basket would have been had it been purchased in 1910. 
If the price is lower than the price of the basket which was actually 
consumed in 1910, then we conclude that (in 1910) the 1910 basket 
was considered preferable to the 1900 basket. We repeat this proce- 
dure for every possible pair of years in some time period. Then we 
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search for intransitivities—for example, the 1910 basket preferred to 
the 1900 basket, the 1900 basket preferred to the 1905 basket, and the 
1905 basket preferred to the 1910 basket. In other words, we search 
for violations of the Strong Axiom of Revealed Preference. Such an 
intransitivity is taken as evidence of a change in tastes. 

These methods can be applied either to total national consumption 
or to per capita consumption. We have done both. 

A problem arises when a good is available only in certain years—for 
example, there appear to have been no raspberries in Britain before 
1922. This problem (which was rare) was solved by declaring the good 
in question to be identical with some other good, the choice of “other 
good” being dictated partly by aesthetic considerations and partly by 
the observation that identical goods should have nearly identical price 
series. (Raspberries were classed with black currants.) 

When a good is included in the market basket, it is important to 
include all close complements and substitutes for that good as well. If, 
for example, our basket consisted only of automobiles and bicycles, we 
might discover a preference for more bicycles and fewer automobiles 
in 1978 than in 1971—reflecting not a taste change but a rise in the 
price of gasoline. The problem would not arise if gasoline were 
included in the basket. 


II. Evidence 


The methods described in Section I were applied to six sets of data, 
which are described below. No intransitivities were observed in any 
case. 

We first considered total national consumption of nine goods: food, 
alcoholic beverages, tobacco, clothing, transport and communication, 
housing, fuel, household durables, and “other.” The study covers the 
years 1900-1955. It is noteworthy that no intransitivities occur 
throughout this period despite the fact that the definition of the 
United Kingdom changed with the secession of Ireland in 1920. This 
may be taken as a small bit of evidence that the tastes of the nation did 
not change even when the composition of the nation did. 

The second study was identical to the first with respect to the time 
period and the goods considered; it reflected per capita rather than 
total consumption. 

The third and fourth studies were identical to the first and second 
with the exception that housing was eliminated from consideration. 
This was because expenditure on housing was defined as the total of 
rents, rates, and water charges collected, a definition which is clearly 
objectionable (e.g., it fails to include the opportunity cost of owner- 
occupied housing). 
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Fic. 1.—Revealed preferences among years for total U.K. consumption of all items, 
classified into nine categories. 


Finally, we broke food, alcohol, and tobacco down into 107 subcate- 
gories and studied data on those 107 goods for the years 1920-38. 
Again this was done both for total and per capita expenditures. 

Figure | provides an example of the results which were obtained. It 
depicts the conclusions of the first study described above, concerning 
total national consumption of nine broadly defined categories of 
goods. An arrow A — B indicates that the consumption of year A was 
revealed preferred to that of year B. Revealed preferences which can 
be inferred by transitivity from the ones depicted have been omitted 
from the figure in the interests of clarity. Analogous charts for the 
other studies described above are available from the author. 

The data for the first four studies mentioned were taken from 
Mitchell and Deane (1954); for the other two they were taken from 
Stone and Rowe (1954). 

It should be noted that the upward trend in income, particularly 
toward the end of the period under consideration, would tend to hide 
changes in taste even if they did occur.’ However, the tests in the next 
section will explicitly take this phenomenon into account. 


III. Interpretation 


Our failure to observe a taste change supports the belief that no taste 
change occurred; now we ask how strongly that belief can be sup- 
ported. Upon making certain assumptions and reexamining the data, 
we shall be able to infer that it is unlikely that the belief is far wrong. 


‘I am indebted to Gary Becker for this observation. 
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While the model which we shall develop could theoretically be 
applied to a market with an arbitrary number of goods, practical 
considerations such as limited computational resources require that 
we apply it only in the two-good case. For that reason, we shall restrict 
our detailed description of the model to the two-good case; the results 
for the n-good case are available on request. 

Let X and Y be two goods with prices py and py. We shall suppose 
that the utility function is the “constant elasticity of substitution” 
function U(X,Y) = C(AX" + BY“)'", where A, B, C, and wu are constants 
and 0 # u < 1. It follows that the elasticity of substitution between X 
and Y is everywhere equal to 1/(1 — uw). 

We have dU/dX = AC (AX" + BY")“ 1X"! and dU/dY = BC(AX™ + 
BY*)—1y“-1. By changing the units in which X and Y are measured 
we may assume that utility is maximized when one unit of each is 
consumed, from which we infer that (dU/dX)/(dU/dY) | aw = pxlpy, 
that is, A/B = py/py. 

We suppose for the moment that tastes change while relative prices 
and real income do not. We shall interpret a change in tastes as a shift 
to a new utility function V (X,Y) which is such that dV/dX = a(dU/dX ) 
and dV/dY = B(dV/dY), where a and B are observations of random 
variables & and f. That is, we assume that at the margin, any unit of X 
or Y will yield @ or B times as much utility as it did before. The 
possibilities a < 6 and B <a should be equally likely, so we assume 
that &/@ has a median of one. The function satisfying these require- 
ments is V (X,Y) = C(aAX" + BBY")'". This has the same indifference 
curves as V(X,Y) = C[(a/B)AX" + BY“}', so we may replace a by a/B 
and B by one. Thus our assumption is that the new utility function has 
the form V (X,Y) = C(aAX" + BY")"", where @ is an observation ofa 
random variable & with a median of one. (The hypothesis that tastes 
do not change is the hypothesis that a has variance zero.) 

Now dV /dX = aAC(AaX" + BY") 1X"! and dV/d¥ = BC(AaX™ + 
BY“ 4y" 7, Indight of the budget constraint, the new levels of 
consumption must be | + ¢ units of X and 1 — (A/B)t units of Y for 
some number ¢. In order to maximize utility, ¢ must satisfy 


dV /dX 


i ee ee 
that is, 
aA 
oe Nee ee eB 
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Fic. 2 


Now let us suppose that in our second period we have not only a 
new utility function but a new budget line as well. If the two budget 
lines fail to cross, then there is no hope of observing a taste change, 
because no point on the inner budget line can ever be revealed 
preferred to any point on the outer one. Thus we restrict our atten- 
tion to the case in which the budget lines do cross, as in figure 2. We 
can rename the goods so that the price of X falls relative to that of Y 
between the first and second periods; thus we may assume that LM 
represents the first-period budget line and JK the second-period 
budget line. 

When the basket consumed in the first period is on segment OM 
and the basket consumed in the second period is on segment OJ, each 
is revealed preferred to the other and a taste change is observed. An 
estimate of the probability of this occurrence would yield a measure of 
the strength of our negative result; however, we shall argue that the 
hypothesis of taste change can be more strongly rejected through 
consideration of the “exact opposite” situation—that in which neither 
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Fic. 3.—If there is an indifference curve tangent to OM, there must be one tangent 
to OBR. 


basket is revealed preferred to the other. In terms of figure 2, this 
situation (which we shall refer to as incomparability of the baskets) 
arises when the first period’s basket is on LO and the second period’s 
basket is on OK. 

Now suppose that the first period’s basket is observed to be on LO. 
In keeping with our earlier assumptions, our supposition is that the 
point (1,1) lies on LO. If the utility function had changed and the 
budget line had not, the new utility-maximizing market basket would 
have been at the point [1 + ¢, 1 — (A/B)t] on line LM. If this point 
happens to be on OM, then with the new budget line the utlity- 
maximizing market basket will be on OK, as is evident from figure 3. 

It follows that if we know that the first period’s market basket lies on 
LO and the point [1 + ¢, 1 — (A/B)t] lies on OM, we will observe an 


incomparability. 


98 JOURNAL OF POLITICAL ECONOMY 


Denoting the coordinates of 0 by Xo and Yo, we see that [1] + ¢, 1 — 
(A/B)t] is on OM if and only if 1 + ¢ 2 Xo. Since 


1 — ali@—» 
SF TApyee ge 
this is the case if and only if 
_— alu-D 
TAB ae Ue 


that is, 


i dd a Xo)(A/B) > qil(u-v 
Xo 


or—since 4/B = (Yy — 1)/(1 — X9)—YVo/Xy = a! = a”, where 7 is 
the elasticity of substitution between X and Y. 


Conclusion 


If, in figure 3, the first period’s market basket is observed to be on LO 
(i.e., outside the budget constraint of the second period), and if Yo/Xo 
is greater than a~", then we will observe an incomparability, provided 
X and Y are measured in units such that the first period’s market 
basket is at the point (1,1). 

Similar considerations show that if the second period’s market 
basket is observed to be on OK and if units are chosen so that its 
coordinates are (1,1), then an incomparability will be observed 
provided that X,)/Y) = a”. 


Remarks 


1. Because we shall be applying our model only in the two-good case, 
we shall have to assume that consumption of all substitutes and com- 
plements for the goods in question remains relatively constant (this is 
so in light of the discussion at the end of Section I). We shall approx1- 
mate this condition by requiring that the first and second periods be 
successive years. 

2. While the form of the utility function precludes the possibility 
that 7 = 1, we shall assume for ease of exposition that it is very close to 
one. (In Section IV we will explore the effect of letting 7 take on other 
values.) Thus we reinterpret the conclusion as saying that an incom- 
parability will be observed provided Yo/X9 or Xo/Yo, whichever is 
appropriate, is greater than a7!. 


The data which were used to construct figure 1 were reexam- 
ined, and values of Y)/X») or X,j/Y) were computed for every pair of 
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successive years and every pair of goods with crossing budget lines. 
The results, for total and per capita consumption, with and without 
the questionable housing data, are displayed in table 1 at the end of 
this section. (In these new studies, as in the earlier ones which in- 
volved more than two goods at a time, no intransitivities were ob- 
served.) 

Under total consumption there are 604 relevant cases (i.e., cases 
where the budget lines cross). Only nine incomparabilities were ob- 
served. More important, there are two cases in which an incompara- 
bility would have been observed if a~! had been less than 1.00. If we 
assume that tastes do change (i.e., that & is not identically one), then, 
by our assumption that & has median one, a~! must be less than one 
50 percent of the time; therefore the probability of our observing an 
incomparability in one of these two cases is greater than 1 — (1 — 0.5)? 
= 0.75. Thus our failure to observe an incomparability in either case 
allows us to reject the hypothesis of taste change with 75 percent 
confidence. The above remarks apply to total consumption with or 
without housing data. 

Under per capita consumption, there are 673 cases and only three 
incomparabilities. There are 14 cases in which an incomparability 
would have been observed if a7! had been less than 0.95. (There are 
eight such cases if one ignores the housing data.) Among these cases 
there is one incomparability. If we were to assume that a“! is less than 
0.95 even 25 percent of the time, the probability of its being so only 
once in 14 trials would be only 10.1 percent. Thus we can say with 
89.9 percent confidence (63.3 percent confidence if we do not use the 
housing data) that a“! is not less than 0.95 even as much as 25 percent 
of the time. 

In three cases, none of which involved housing data, X0/Yo or Yo/Xo 
was greater than 0.995. No incomparabilities were observed in any of 
these cases. Calculating as above, we find that this allows us to reject, 
with 57.8 percent confidence, the hypothesis that a~' is less than 0.995 
more than 25 percent of the time. 


Summary of Results. 


Using total consumption, a“! never differs from one (75 percent 
confidence). Using per capita consumption, a“! is not less than 0.95 
more than 25 percent of the time (89.9 percent confidence, 63.3 
percent without housing), and a’ is not less than 0.995 more than 25 
percent of the time (57.8 percent confidence). These results say, with 
some degree of confidence, that @ seems to have very small 
variance—and this is just what is meant by the assertion that tastes do 
not change. 


TABLE 1 


























WirH Housinc Data WitHout Housinc DaTA 
Xo/¥o 
OR No. of No. of Incom- No. of No. of Incom- 
MeN Cases parabilities Cases parabilities 
Total consumption: 
.00—.049 54 0 46 0 
.05—.099 40 0 Il 0 
.10—.149 43 0 32 0) 
.15-.199 39 0 27 0 
.20—.249 54 0 43 0 
.25—.299 45 0 33 0 
.30-.349 34 0 26 0 
.35-.399 30 0 Zi 0 
.40-—.449 2, 0 16 0 
.45—.499 28 0 22 0 
.50-.549 33 0 27 0 
.55-.599 20 0 15 0 
.60—.649 28 0 oT 0 
.65—.699 Do 0 14 0) 
.70—.749 25 0 20 0. 
.75—.799 21 0 ea 0 
.80—.849 20 1 E33 1 
.85—.899 13 0 11 0 
.90—.949 15 0 8 0 
.95-.999 15 8 10 7 
1.00 2 0 wa 0 
Total 604 9 455 8 
Per capita consumption: 
.00—.049 62 0 47 0 
.05—.099 44 0 38 0. 
.10—.149 52 0 40 0 
.15-.199 56 0 37 0 
.20—.249 51 0 39 0 
.25—.299 42 ) 30 0 
.30-.349 39 0 292 () 
.35-.399 a2 0 25 0. 
.40-.449 34 0 25 0 
45—.499 32) 0 23 0 
.50—.549 30 0 25 0 
.55—.599 26 0 21 0 
.60—.649 29 0 19 0 
.65—.699 29 0 21 0 
.70—.749 22 0. 17 0 
.75—.799 21 0 11 0 
.80—.849 24 0 20 0 
.85—.899 15 0 9 0 
.90—.949 19 2 13 1 
.95—.999 13 1 W 1 
1.00 1 0 ] 0 
Total 673 3 500 2, 
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It is important to note that the results listed above were derived 
without any reference to the fact that in all of our studies we never 
observed a taste change (the phrase “all of our studies” refers both to 
the set of studies described in Section II and to the 1,277 separate 
studies described in this section). This striking fact, which is really the 
main result of this paper, surely mitigates our disappointingly low 
levels of confidence, but I know of no way in which to quantify the 
extent to which it does so.” 


IV. Discussion 


Like most models, ours is laden with assumptions with which a reason- 
able man might take issue. Here we reconsider some of the points 
which are most open to question. 


1. The Form of the Utility Function 


It is certainly the case that our choice is arbitrary. However, utility 
functions of this form are often assumed in economics, and our 
results do indicate that we should be wary of models which assume 
such utility functions and also allow for changes in taste. Thus, even if 
we were to grant that our utility function is not an accurate reflection 
of economic reality, we would still be able to draw conclusions about 
the consistency of certain economic models, and this is a topic which 
should be of concern to economists. 

It should also be noted that, after some effort, I have not been able 
to find a “reasonable” utility function which yields results substantially 
different than does the one adopted here. 


2 One way to begin to quantify it is the following: Let py and py be the prices of X and Y 
in the 7th year and let 


p = Pitt 
pb 


(so that D is always less than one, because the goods were renamed so that the relative 
price of X fell). Then it can be shown that a taste change will be observed if and only if a 
< (X,/Y))D or a < (¥o/X,)D (depending on whether the first year’s basket is inside the 
second year’s budget line or vice versa). Now we could proceed as in the body of the 
paper, measuring values for (X9/Yo)D and (Y0/Xo)D instead of Xo/Yo and Yo/Xo. This 
procedure—which amounts to measuring the strength of our failure to observe a taste 
change by measuring the probability that a taste change could occur and go 
unobserved—seems more straightforward than the one adopted in the text, but when 
applied to the same data it yields numerical results which are substantially weaker. 
Given the assumptions of our model, this means that even if tastes changed substan- 
tially, our failure to observe a taste change in any of the 1,277 cases of Section HI would 
not be too surprising. However, this calculation ignores all of the findings of Section IT, 
which appear to be quite strong. I do not know the best way to quantify their 
strength—a naive generalization of the current model would require additional as- 
sumptions and very extensive computational resources. 
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2. The Exogeneity of Price Changes 


On the surface this is a very damaging objection. If prices and tastes 
do not change independently, as we have assumed, than a considerably 
subtler analysis of the problem is required. However, reasoning as in 
(1) above, we argue that the fact that many economic models do 
assume exogenous price changes implies that we can at least draw a 
conclusion about those models. The relevant question for economists 
is not “Do tastes change?” but “Are models which assume taste 
changes consistent with reality?” The latter is the question which we 
have really been addressing; the answer, for a large class of models, 
appears to be “no.” 


3. The Form of the Taste Change 


All that can really be said here is that the form which we have assumed 
seems reasonable and that other reasonable forms yield similar re- 
sults. If, for example, we had assumed that the new utility function 
was given by V (X,Y) = U(aX,Y), rather than being determined by the 
requirement that dV/dX = a(dU/dX), then our conclusion would have 
been exactly the same except that a!” would have appeared instead 
Oar? 


4. The Assumption That y Is Close to One 


Our calculation of the confidence with which we can say that a never 
differs from one (i.e., that tastes do not change at all) is unaffected by 
this assumption: It would remain the same for any value of 7. With 
regard to our calculation of the confidence with which we can say that 
a is not less than some given value more than a given percentage of 
the time, our result would be stronger for larger n and weaker for 
smaller 7. 


5. The Assumption That a Is Drawn Independently Each Time 


This assumption was essential when we calculated confidence levels in 
Section II. Our first calculation was that we could say with 75 percent 
confidence that @ never differs from one. The two cases involved in 
this calculation were transport and communication versus tobacco for 
1907-8 and fuel versus alcoholic beverages for 1937-38. The 30-year 
separation, together with the fact that entirely different goods are 
involved, seems to justify the assumption that a was drawn indepen- 
dently in these two cases. 

Our strongest numerical result involved the confidence with which 
we could say that @ was not less than 0.95 more than 25 percent of the 
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time; it was calculated to be 89.9 percent on the basis of 14 cases. The 
14 cases involved are listed in the unnumbered table below. 





1. Housing vs. food 1913-14 
2. Housing vs. alcoholic beverages 1942-43 
3. Housing vs. tobacco 1916-17 
4. Housing vs. tobacco 1943-44 
5. Housing vs. fuel 1951-52 
6. Household durables vs. tobacco 1922-23 
7. Household durables vs. fuel 1927-28 
8. Household durables vs. fuel 1929-30 
9. Household durables vs. transport and communication 1905-6 
10. Household durables vs. transport and communication 1909-10 
11. Alcoholic beverages vs. other 1909-10 
12. Household durables vs. other 1904-5 
13. Housing vs. other 1902-3 
14. Fuel vs. other 1901-2 








(Case 14 was the one in which an incomparability was observed.) The 
reader is invited to form his own opinion as to how many of these 
cases are really independent. If, for example, he discards cases 8 and 
10 on the grounds that they are too closely tied to cases 7 and 9, then 
we have 12 independent cases instead of 14 and our confidence drops 
from 89.9 percent to 84.2 percent. 


V. Conclusions 


The main result of this paper is that no taste change was observed in 
any of our studies. These included studies of total consumption and 
of per capita consumption, studies of broadly defined categories of 
goods and of very narrowly defined goods, studies of exhaustive and 
nonexhaustive lists of goods, and studies of as many as 107 and as few 
as two different goods. The two-good studies alone presented 1,980 
separate cases; 1,277 of these had crossing budget lines and hence the 
potential for an observed taste change. None of this potential was 
realized. In Section III we calculated levels of confidence with which 
we could say that tastes did not change by more than certain specified 
amounts; these calculations were independent of the fact that no taste 
change was observed and therefore greatly understate the strength of 
our results. 

The morals are the obvious ones: In economics, we should be wary 
of models which include variables for taste change, and outside eco- 
nomics we should be skeptical of warnings about the “wild vicissitudes 
of taste” and the constant danger of having our vicissitudes manipu- 
lated. With regard to this last point, it is particularly interesting to 
note that our study covers the period in which the electronic media 
came into being and into flower. 
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It is easy to postulate specific forms of taste change among the 
goods which we have studied, and it is easy to postulate forms which 
our methods could not have observed. It is considerably harder to 
postulate such forms which satisfy the additional restriction that they 
be reasonable. I do not believe that is possible, at least not without 
violating exogeneity of prices. If this is correct, then our results have 
strong implications for all price-theoretic models in which price 
changes are exogenous. 
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The Determinants of Tariff and Nontariff Trade 
Restrictions in the United States 





Edward John Ray 


Ohio State University 


This paper develops and tests a simple model for the determination 
of tariff and nontariff barriers to trade across industries within the 
United States, using 1970 trade data. We find that nontariff trade 
restrictions have supplemented tariff protection in the United States. 
Both tariff and nontariff trade restrictions are biased toward indus- 
tries in which the United States has an apparent comparative dis- 
advantage in world trade and away from industries in which con- 
sumer welfare losses from protection would be great. We also find 
substantial evidence that tariff and nontariff trade restrictions pre- 
dominate in industries with very different market characteristics. 


The purpose of this study is to synthesize a number of ideas expressed 
in earlier work into a simple analytical framework to explain the 
structure of tariff and nontariff barriers to trade across industries in 
the United States. Our central premise is that, subject to political 
constraints, trade restrictions are consistent with profit maximization 
across industries. Using International Trade Commission data for 
tariff and nontariff trade restrictions and industry characteristics for 
225 U.S. manufacturing industries in 1970, we find that such an 
approach has significant explanatory power. 

The resulting empirical analysis is unique in a number of important 
respects. First, we provide separate reduced form estimates of tariff 


The author is grateful to Betsy Radigan, Steve Cook, Ed Honton, and Jim Brown for 
aid in generating the Tobit and Probit results. Professor Tetsunori Koizumi, Jacob 
Frenkel, Howard Marvel, Donald Parsons, Jerry Thursby, Marie Thursby, Thomas 
Wolf, and, most particularly, J. David Richardson provided numerous helpful com- 
ments on earlier drafts of this paper. 
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and nontariff trade restrictions in the United States. Second, by ex- 
tending our analysis to include the estimation of a simultaneous 
model for the determination of tariff and nontariff trade restrictions, 
we provide empirical support for our hypothesis that nontariff trade 
restrictions have been utilized in part to compensate for internation- 
ally agreed-upon tariff reductions in the post-World War II period. 

The third important finding is that, while tariff and nontariff trade 
restrictions co-exist in industries with some common characteristics, 
there are some distinct differences in industry characteristics that are 
more strongly associated with one rather than the other alternative 
form of protection. Specifically, we find that both tariff and nontariff 
trade restrictions are biased toward industries in which the United 
States is at a comparative disadvantage in world trade, and away from 
industries in which losses in consumer welfare from protection would 
be particularly large. But there are some real differences too. Tariffs 
are biased toward industries which are low-skill intensive, away from 
industries that are capital intensive, and unrelated to product hetero- 
geneity, the concentration of production, and the geographical dis- 
persion of domestic production facilities. In contrast, nontariff trade 
restrictions are concentrated in industries producing fairly homo- 
geneous products using relatively capital-intensive techniques of 
production that, at the same time, are not intensive users of low- 
skilled labor. In addition, nontariff trade restrictions are found pre- 
dominantly in industries in which production is less concentrated and 
in which production facilities are distributed across regions of the 
United States in a fashion consistent with the distribution of popula- 
tion and, therefore, voting power in Congress. 


I. The Political Economy of Trade Restrictions 


The general approach of this paper is that the structure of tariff 
and/or nontariff trade restrictions across industries is consistent with 
the simple joint maximization of industry profits subject to political 
constraints. In effect, we assume that individual industry characteris- 
tics influence the profitability of trade restrictions and that political 
factors similar to those discussed by Cheh (1974, 1976), Pincus (1975), 
Baldwin (1976a), Caves (1976), Caves and Jones (1977), Helleiner 
(1977), Stone (1978), and others are critical in determining which 
industries are most successful in obtaining restrictions on imports of 
competitive products. 

The primary information to emerge from a simple analysis of the 
relationship between industry rents and protection is that the profit- 
ability of trade restrictions should normally be negatively related to the 
absolute value of the own price elasticity of demand for the product 
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and positively related to the foreign elasticity of supply of the prod- 
uct.’ In effect, the more responsive consumers are to price increases 
induced by trade restrictions and the less responsive foreign suppliers 
are to changes in the international price of the good, the less profit- 
able any given level of protection will be. 

Furthermore, a discussion of the profitability of protection assumes 
that imports represent a reasonable threat to sales in the United States 
by domestic producers. If concern for industry profits is important in 
explaining the structure of trade restrictions, we would expect trade 
barriers to be biased away from industries in which the United States 
has a comparative advantage in international trade because imports in 
such industries would be expected to be inconsequential and restrict- 
ing them would provide little benefit. We included a number of 
variables that previous studies have indicated are associated with U.S. 
comparative advantage or disadvantage in trade with the simple ex- 
pectation that trade restrictions should be found predominantly in 
product lines in which the United States is at a comparative disad- 
vantage vis-a-vis the rest of the world. 

Obviously, if there are positive terms of trade effects associated with 
protection sufficient to offset the deadweight consumption and pro- 
duction costs created by trade barriers, the government can generally 
increase aggregate social welfare and support for itself by restricting 
trade. But substantial terms of trade effects are not likely to exist for 
more than a handful of industries at present levels of protection. In 
general, increased protection would be expected to promote industry 
profits at the expense of the general public. In a broad sense, profits 
should be thought of as the sum of rents to entrepreneurs and to 
workers with firm- or industry-specific human capital and/or a strong 
union. The costs to the general public include artificially high product 


" Assuming that protection is less than prohibitive, that foreign supply depends only 
on the foreign price, and that the domestic market clearing price, Pp, is positively 
related to the degree of trade protection given, we can generate the following relation- 
ship between industry profits, II, and trade restrictions: 


all MCp Or | aP, 
Ae = 4 -P 
ear pal — 1) Dn + Srer(1— Pr seals: 


where Sp represents domestic supply, MCp is the domestic marginal cost of production, 
D is aggregate domestic demand, n = the absolute value of the own price elasticity of 
demand for the product, Sp, €r, and Pp represent the foreign supply, foreign supply 
elasticity, and foreign price of the product, respectively, and 7 represents the tariff or 
tariff equivalent rate. Details for this derivation are available upon request from the 
author. 

* There is, of course, the interesting possibility that the United States has a compara- 
tive advantage in some product and that trade restrictions are used to price discrimi- 
nate between the domestic and foreign market to maximize monopoly rents. That such 
a phenomenon could be systematic and significant across the 225 manufacturing 
industries we have data for is doubtful. 
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prices, misallocation of productive resources, and waste in terms of 
administrative costs of implementing and maintaining protectionist 
programs. 

In order to rationalize the existence of trade restrictions for 
numerous commodities in the United States, we assume that, up to 
some limiting degree of trade restraint, industry interest groups are 
more responsive to trade policy than consumers are.? One simple 
explanation of such an asymmetry in response to trade policy by 
industry interests and by consumers stems from the fact that infor- 
mation about market conditions is costly and imperfect and must be 
updated as market conditions change. To the extent that producers 
are less numerous than consumers and their well being (wealth) is 
more directly affected than the wealth of individual consumers by 
trade policy, they will invest more resources in keeping informed 
about and be more sensitive to trade policy changes affecting their 
industry. Clearly, the reverse could be true for industries in which 
buyers are more concentrated than sellers and the wealth effects are 
greater per consumer than they are per producer. But such cases are 
exceptional. This asymmetry in responsiveness does not imply that 
seller concentration need be positively related to protection. In fact, in 
his study of the structure of tariffs in Canada, Caves (1976) suggested 
that concentration could be negatively related to protection. Presum- 
ably, the argument would be that the spoils of protection will buy 
more votes the more widely they are dispersed. In each industry there 
may exist some critical value of seller concentration relative to buyer 
concentration, determined by the cost of market information which is 
required for the industry to obtain protection. And among protected 
industries there may be a negative association between protection and 
concentration, CONR. In short, the impact of seller concentration on 
the existence and extent of trade restrictions is ambiguous. 

So far we have argued that producers and workers in an industry 
have a greater economic incentive on average than consumers in 
general do to acquire costly information about the impact of trade 
restrictions on their prices, profits, and wages and in continuing to 
invest in such information gathering as market conditions change and 
old information decays or becomes less relevant. However, constant 
changes in trade restrictions could easily attract media attention and 
thereby provide consumers with relatively cheap and current infor- 


* Apart from the observation that nonzero trade restrictions exist for industries that 
enjoy no obvious terms of trade effects, we provide no direct test of our assumption that 
industry interests are more responsive than consumer interests to changes in trade 
policy. However, the notion that industry interests are weighted more heavily than 
consumer interests in determining the structure of protection is consistent with earlier 


empirical work by Ray (1974), Baldwin (1976), and Caves (1976) 


TRADE RESTRICTIONS 109 


mation regarding the impact that proposed changes in trade limi- 
tations would have on their welfare. Therefore, pressure to alter 
trade restrictions may follow a fairly discrete time path in order to 
permit industry interest groups to capitalize on the high cost and low 
quality of information available to consumers.’ Maintaining a given 
quota in the presence of expanding demand or foreign supply would 
be comparable to imposing higher and higher explicit tariffs over 
time. Conversely, if demand for a product is declining and/or foreign 
supply is shrinking, a quota is comparable to a declining explicit tariff. 
Therefore, assuming some stickiness in policy changes, quotas or 
other nontariff trade restrictions would have some relative advantage 
in expanding markets while tariffs would be somewhat advantageous 
for protecting declining product markets.® In our empirical analysis 
we will attempt to test the relative explanatory power of dynamic 
demand, AD, and supply, AS, conditions in tariff and nontariff trade 
restriction regressions. 

Finally, simple demand and supply analysis can be used to demon- 
strate that the deadweight loss associated with a given trade restriction 
will be positively related to the absolute value of the own price elastic- 
ity of demand for the product, y, and positively related to the domes- 
tic elasticity of supply, €p. To the extent that the government is 
strongly concerned with the overall welfare effects of trade restric- 
tions, tariff and nontariff trade barriers would be negatively related to 
the absolute value of the own price elasticity of demand for the 
product and negatively related to the domestic elasticity of supply for 
the product. 

We can summarize the discussion to this point by indicating the 
tariff and nontariff trade barrier regressions to be estimated, with the 
expected sign of the coefficient specified below each variable: 


Ea; Er, Ep, X1, asl =i Nas CONR, AD, AS) (1) 
a) Bs oo) scien tenes “Ve ks. Sahoo, Be aia ate 
and 
N = G(n, ep, Ep, X41; see A > Xa CONR, AD, AS, 7) (2) 
-,?,-,- -, ? +, +,+ 


Pee > SP ES! Sw, > > 


where N represents a measure of nontariff trade restrictions, the 
ambiguous sign on the elasticity of foreign supply, €,, reflects the fact 
that the sign on the foreign supply elasticity would only be positive in 


4 Similar views regarding the relative merits of tariffs and quotas in the context of 
dynamic market changes have been expressed by Kreinin (1970), Fishelson and Flatters 
(1975), Caves and Jones (1977), and others. 

> Cheh (1974) found some evidence that tariff cuts during the Kennedy Round were 
less vigorously applied to declining industries in the United States. 
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the noncompetitive case, €p represents the domestic supply elasticity, 
X,,...,X, are alternative measures of U.S. comparative advantage, 
CONR is a measure of seller concentration, and AD and AS represent 
the expected percentage growth in demand and supply at the time 
that trade restrictions are set (both are expected to be more positively 
significant in explaining nontariff than tariff protection). 

In the empirical section we treat the implementation of tariff and 
nontariff trade barriers as a sequential process. Tariffs predate non- 
tariff barriers in virtually every case, and nontariff trade restrictions 
are estimated as functions of both tariffs and of the political and 
economic factors we identify as important explanatory variables. 
Later we test for this sequential effect and find evidence that it does 
eXISt. 


II. Empirical Results 


The empirical results discussed in this section were generated from 
data made available by the U.S. International Trade Commission in 
1975. More precise definitions of the variables used in the regressions 
are given in the Appendix. The observations consisted of 225 four- 
digit manufacturing industries in the United States in 1970 and are 
specified in a separate appendix available upon request. 

Alternative forms for tariff regressions were estimated and are 
presented in table 1, while estimates of alternative specifications for 
nontariff trade barrier regressions are presented in tables 2 and 3. 
For regressions (1.1)—(1.3) in table 1, the dependent variable used was 
a weighted-average nominal tariff measure for each industry in 1970. 
In effect, the four-digit tariff rates were calculated by weighting the 
tariffs of less aggregated components by their import shares within 
the classification for the given four-digit industry. Regressions 
(1.4)—(1.6) differ from the earlier ones only in terms of the specifica- 
tion of the dependent variable. In regressions (1.4)—(1.6) the depen- 
dent variable is the four-digit industry nominal tariff rate calculated 
as the simple average of component nominal tariff rates. 

The dependent variable in table 2 is an Index of the Incidence of 
Nontariff Barriers in the U.S., 1970, constructed by the U.S. Tariff 
Commission.® In effect, the index measures the comprehensiveness of 


° The index was constructed by the U.S. Tariff Commission and is explained in Trade 
Barriers, Report to the Committee on Finance of the U.S. Senate, Part 2, pp. 160-72, 
Washington, D.C., April 1974. The quantitative restrictions included and the weighting 
scheme used were as follows: Bilateral Quota (0.91); Global Quota (1.96); Quota 
(unspecified) (1.36); Prohibited Imports (embargoes) (1.36); State Trading (0.91); 
Automatic Licensing (0.45); Liberal Licensing (0.45); Discretionary Licensing (0.91); 
Licensing (unspecified) (0.91); Minimum Price System (1.36); Seasonal Restriction 
(0.91); Restriction (unspecified) (0.91); Export Restraint (VERs) (1.36); Suspended 
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TABU EE2 


U.S. NoNTARIEF TRADE ResTrRictions (Tobit) 








DEPENDENT VARIABLE 





Index of the Incidence of Nontariff Barriers in the U.S.: 1970 











INDEPENDENT 
VARIABLES (2.1) (2.2) (233) (2.4) (235) (2.6) 
Constant 1.9] .26 6.68 DN 38 6.16 
(1.02) Ciiy (2.43) (0.11) (.16) (2.13) 
FTDNMOS site EE Se NOs oll SelOee ae I Slee 31 10 
(.54) (.38) (.46) (.03) 
CONR4L — .04 —.05 —.06 —.04 Sy —.06 
(1.92) (2.36) (2.86) (1.98) (2.44) (2.74) 
SKILLD —5.05 —6.78 —2.42 —6.26 —8.16 —6.04 
(1.13) (1.49) (.53) (1.42) (1.82) (1.37) 
SEINRD bes 4.99 2.14 nae 5.95 4.06 
(1.08) (.47) (1.29) (.89) 
ESCAL(67) 2.44 2.78 34 1.61 2.01 Smt 
(.88) (1.00) (.12) (.59) (73) (Gli) 
LABINT spe ae —12.69 aah ices —10.05 
(3.76) (3.07) 
KLRA Sate Slee Ome wears aati oll Se TKR wae 
(2.14) (2.02) 
WDAVPD Soe) —4.004 Sah) —3.76 =3.87 09 
(3.62) (3.63) (3.36) (3.49) (6253) (3.26) 
AIMP —.07 Sa = (08 —.08 —.06 ney 
(.75) (.56) (34) (.83) (.64) (50) 
ACON 1.40 1.34 .09 1B 1.33 24 
(1.01) (.95) COTE (E04) (.93) (.17) 
USWTTF .20 .20 .26 me ete he 
(2.97) (2.99) (3.72) 
USSMPTAR ae Snide ene 18 18 19 
(2.70) (2.70) (2.84) 
Norte.—158 of the 225 observations had the limit value of zero. 


nontariff protection in an industry. Fifteen types of nontariff trade 
restrictions were considered and assigned weights reflecting their 
relative effectiveness in limiting imports. Then the extent of nontariff 
protection given to each industry was calculated as the percentage of 
actual to potential protection to be derived from all 15 nontariff 


Import Restriction (0.91); Mixing Regulations (0.91). The weights on the right were 
determined by assigning each restriction a number from | to 3, high or low, depending 
on its degree of restrictiveness. The average of those numbers was divided into the 
assigned number for each restriction, resulting in the weights shown. Index values for 
industry categories were then obtained as follows: For any given commodity, the sum of 
its specific weights divided by the sum of weights for the 15 categories (11.35) ex- 
pressed in percentage terms yielded the percentage of maximum nontariff restraint 
of trade given to that industry. Total imports of major trading countries were used as 
weights to aggregate the basic product category data. Such a weighting means that 
restrictions in a heavily traded category were counted more than restrictions in lightly 
(world) traded categories. In cases in which the listed restrictions were known to apply 
to only part of a given product category they were arbitrarily counted only half as much 
as restrictions known to be applicable to an entire category. 
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TABLE 3 


U.S. Nonrarirr TRADE Restrictions (Probit) 








DEPENDENT VARIABLE 





Dummy Variable for Nontariff Barriers in the U.S.: 1970 








INDEPENDENT 
VARIABLES (3.1) (3.2) (353) (3.4) (3.5) (3.6) 
Constant .20 —.26 2.35 36 09 2250 
(.41) (.41) (2.75) (.72) (Gl) (2a) 
FTDNMOS Bee On Omeameela cai Oe? a = 56 10 gil se lO 
(.34) (1.20) (.45) (1.28) 
CONR4L —.01 = 0)i nt) eee ine —.01 = 02 
(1.38) (2.36) (3.10) (1.46) (2.41) (2.96) 
SKILLD —1.79 —2.77 —1,47 —213 —3.14 =2.51 
(1.52) (2.21) (USS) (1.84) (2.55) (2.01) 
SEINRD wd 2.68 1.86 2 2.84 aPD 
(2.21) (1.46) (E37) (1.80) 
ESCAL(67) SD) ao) = 1273 —.99 —.79 — 382 
(1.09) (.74) (1.96) (1.34) (1.00) (2.14) 
LABINT ite he —4.79 aati ; —4.01 
(4.75) (4.22) 
KLRA oa mile onl Cee ae ea Plena eee 
(2.95) (2.78) 
WDAVPD — 41 —.47 —.40 =, —.46 eae 
(2.53) (2.78) (2722) (2.39) (2.68) (2.16) 
AIMP —.27 —2.29 02 —.03 —i02 a2 
(1.05) (0.87) (.60) (1.07) (.92) (.69) 
ACON 2 48 .04 50 44 02 
(1.46) (1.29) (Get) (1.41) (1.19) (.05) 
USWTTF .06 .06 .08 Ae te ee 
(2.77) (2.80) (3.53) 
USSMPTAR oe pire ee 05 05 .O5 
(2.49) (2.51) (2.77) 
Pseudo R? Oy so 39 .24 30 35 
Likelihood 44.29 56.42 73.03 42.29 54.25 65.85 
Ratio 
Test* 





* For eqq. (3.1) and (3.4), a value above 18.50 for the likelihood ratio test is in the upper 1% of the distribution. For 
the remaining equations, a value above 23.2 for the likelihood ratio test is in the upper 1% of the distribution. 


restrictions. Industries with high percentage index values are then 
viewed as having received more comprehensive nontariff protection 
than industries with low percentage index values. The dependent 
variable in table 3 is simply a dummy variable which equals 1 if the 
index for nontariff barriers is positive and zero otherwise. In every 
other respect the equations estimated in tables 2 and 3 are the same. 
There are three different specifications for each of the tariff re- 
lationships and six different specifications for each of the nontariff 
relationships because we have taken an eclectic approach to the em- 
pirical explanation of U.S. comparative advantage. Within each subset 
of equations the alternative regressions differ only in their specifica- 


114 JOURNAL OF POLITICAL ECONOMY 


tion of the X,,..., X, variables alluded to in Section I and in the 
specification of the tariff variable in the nontariff trade restriction 
regressions. The various specifications of variables explaining U.S. 
comparative advantage are derivative from previous empirical works, 
including Stern (1964), Hufbauer (1970), Baldwin (1971), Caves 
(1976), Cheh (1976), and Stone (1978). The tariff regressions were 
estimated using ordinary least squares. Since only 67 of the 225 
industries had nonzero, nontariff trade restrictions, the nontariff 
trade barrier index equations and the nontariff trade barrier dummy 
equations were estimated using Tobit and Probit estimating tech- 
niques, respectively. 

International trade agreements and the decline in the importance 
of tariffs as a source of federal revenues have tended to induce the 
substitution of nontariff trade restrictions for tariffs in protected 
industries. The positive significant coefficient on the tariff variable in 
each of the nontariff trade restriction dummy equations indicates that 
nontariff trade restrictions are concentrated in industries with high 
nominal tariffs. Whether those nontariff trade restrictions were in- 
duced by tariff ceilings in highly protected industries, or not, is not 
clear. What is clear is that industries with the economic need and 
political clout to obtain high rates of tariff protection are also success- 
ful in obtaining nontariff trade restrictions. The use of nontariff 
barriers to trade in the United States apparently results in a greater 
variation in trade restraint across industries than one would suspect 
from looking at nominal tariff rates alone. 

The positive and significant coefficient on the tariff variable in each 
of the nontariff trade restriction equations in table 2 indicates that 
high tariffs and high rates of nontariff trade restrictions go hand in 
hand. In effect, nominal tariffs alone both understate the extent of 
variation in protection across industries and systematically understate 
the extent to which high tariff industries are being protected. 

To the extent that tariffs have been limited by domestic and inter- 
national political considerations as well as industry preferences for 
nontariff trade restrictions, we would expect the economic and politi- 
cal factors described in Section I to maintain significant explanatory 
power in the nontariff trade barrier regressions. The empirical results 
discussed below are consistent with that expectation. 

Hufbauer (1970), Baldwin (1971), and others have found that 
significant correlations exist between the ability of the United States to 
export products and the skill intensity, research and development 
intensity, and labor intensity of production. Our objective is not to 
replicate previous work but, rather, to introduce variables that have 
been employed as proxies for U.S. comparative advantage in previous 
work as control variables that should enter the tariff and nontariff 
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trade restriction equations with negative and significant coefficients. 
Confirmation of that expectation would be consistent with our 
hypothesis that protection is provided partly in response to industry 
pressure to reduce competition from foreign suppliers in the domes- 
tic market. 

The measures used as controls for comparative advantage include 
the first trading date, FTDNMOS; the percentage of scientists and 
engineers in research and development, SEINRD; and the percent- 
age of skilled workers in the workforce, SKILLD.’ 

Focusing on the results in tables 1 and 3,8 only the skill intensity 
measure has the expected negative and significant coefficient in the 
tariff and nontariff regressions. Numerous studies of trade flows have 
found that the United States imports relatively capital-intensive goods 
and goods which use low-skilled labor suggesting that protection 
would be biased toward industries with both characteristics. However, 
we find that tariffs are negatively related to the capital/labor ratio, 
KLRA, and positively related to labor intensity, LABINT, holding the 
skill coefficient constant, while nontariff trade restrictions are pos- 
itively related to the capital/labor ratio and negatively related to the 
labor intensity of production. 

One could speculate that, since recent additions to trade restrictions 
have been primarily in the form of nontariff barriers to trade, the 
negative coefficient on LABINT reflects an inability of nonunionized, 
low-skilled labor to obtain protection from imports while middle- 
skilled, unionized workers and producers in industries with heavy 
capital components have succeeded in obtaining nontariff trade re- 
strictions. Then the reverse signs on those variables in the tariff 
regressions would have to be attributed to the upper bounds on tariff 
changes that have been imposed in the Post—World War II period. At 
best, that explanation is strained. In any event, our results indicate 
that there is additional important work to be done in this area. 

As suggested in Section I, the impact of sellers’ concentration on 
trade restrictions is ambiguous. In his study of the structure of tariffs 
in Canada, Caves (1976) found a negative and significant relationship 
between Canadian tariff rates and the four-firm concentration ratio. 
While the concentration ratio enters the tariff regressions with in- 
significant and positively signed coefficients, concentration enters the 


‘Two other variables not reported on here: The number of scientists and engineers 
in research and development relative to total employment (SKLPC) and the wage rate 
for production workers (WGPRMH) were both insignificant and often positively signed 
in all of the trade restriction equations. 

* The index of nontariff barriers to trade is sufficiently subjective in nature and the 
results obtained in tables 2 and 3 are sufficiently similar that extended discussion of the 
results in table 2 would not be productive. 
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nontariff trade barriers regressions with a negative significant 
coefficient. 

Caves speculated that a negative relationship between trade restric- 
tions and seller concentration might reflect a bias by politicians to 
provide protection to industries with many producers in which the 
benefits would be widely dispersed. In order to test that hypothesis 
along with the hypothesis that protection has been sequential with 
nontariff trade restrictions supplementing predetermined tariffs, we 
ran the simple simultaneous model summarized in table 4. The results 
presented in table 4 are obtained from two-stage least-squares regres- 
sion models. The two models differ only in terms of the designated 
measure for tariff rates. In the first model, we used the weighted- 
average tariff measure. In the second model, we used the simple 
average tariff measure. In both models we used the simple 0-1 
dummy measure of nontariff barriers to trade. The results obtained 
in table 4 are qualitatively invariant with the choice of the subset of 
comparative advantage variables included. Therefore, for the sake of 
brevity, the estimators presented in table 4 are offered as represen- 
tative. The principal findings are that tariffs do positively and 
significantly affect nontariff trade restrictions, while nontariff trade 
restrictions have no significant impact on tariff determination, sup- 
porting our sequential model, and that nontariff trade restrictions are 
negatively and significantly related to both seller concentration and 
the geographical concentration of production, GEOG.° The negative 
sign on GEOG means that nontariff trade restrictions are biased 
toward industries in which production is distributed across regions of 
the United States in a manner similar to the distribution of population 
and therefore toward industries with substantial representation in 
Congress. 

We used a measure of scale economies in U.S. manufacturing in 
1967, ESCAL(67), as a proxy for the domestic, €p, elasticity of supply. 


9 GEOG is measured as follows: index value = 


xy VS; Pop; 
Z| gett oe ea 
> VS: Dire 
i=1 i=1 
where VS; = value of shipments in region 7,7 = 1,..., 4, and Poy, = population of 
region 7,7 = 1,..., 4. To the extent that production in industry 2 is distributed across 


the north, south, central, and western United States in a fashion similar to the general 
population, the index value will approach zero. The basic data for the index are from 
the Census of Manufactures for 1967. The lack of concordance between TCSIC and SIC 
data means that 27 observations were lost and that table 4 was estimated using 198 
rather than 225 observations. Apart from the fact that the inclusion of GEOG in the 
regression runs for tables 1-3 would have similarly reduced the sample size from 225 to 
198, the negative significance of GEOG in the nontariff dummy equations in table 4 
indicates that it should have been included in the earlier estimates as well. 


SIMULTANEOUS ESTIMATES OF TARIFF AND NONTARIFF 


TABLE 4* 


TRADE RESTRICTIONS IN THE U.S., 1970 








WEIGHTED-AVERAGE 


‘TARIFFS 


SIMPLE AVERAGE 


‘TARIFFS 





Dependent Variables 





U.S. Weighted- 


Dummy Variable 


U.S. Simple 


Dummy Variable 








Average for Nontariff Average for Nontariff 
Tariffs Barriers ‘Tariffs Barriers 
(4.1) (4.2) (4.3) (4.4) 
INDEPENDENT 
VARIABLES OLSQ Probit OLSQ Probit 
Constant 9.69 —2.65 16.02 —7.22 
(2.76) (1.34) (4.08) (2.08) 
FITDNMOS one —.18 x 107 ae eo oe all Ome 
: (1.61) (1.96) 
CONR4L .03 =.02 .03 Se 
(1.97) (2.81) (1.41) (2.91) 
SKILLD —23.52 10.80 —27.02 14.55 
(4.49) (1.97) (4.61) (2.15) 
SEINRD nee —0.60 ate On2 1 
(0.34) (0.13) 
ESCAL(67) —4.24 oes —4.83 Bare 
(1.04) (1.06) 
LABINT 10.19 —7.42 79 —.35 
(2.14) (4.02) (.15) (3.41) 
KLRA = 56 ee A Se KO —.2 x 1074 NB Se Or 
(1.77) (1.88) (1.56) (2.21) 
WDAVPD Se —.44 ts —.46 
(2.29) (2.38) 
AIMP -.9 x 10° —.29 
(33) (1.01) 
ACON 55 81 
(1.17) (1.56) 
PUSWTTF 52 ee 
(2.87) 
PUSSMPT AR? Pie .64 
(2.87) 
PDUM27 .82 Dif ee 
(.94) (.28 
GEOG 1.10 —.92 73 —1.53 
(.94) (2.00) (lea2) (2.55) 
R? 33 A .27 sr 
F-statistic Teo ve 10,25 ae 
Pseudo-R? BB a mol 
Likelihood 49.10 49.10 
ratio test 





* In converting the geographical dispersion index from SIC to TCSIC codes, 27 of the 225 observations had to be 
deleted for missing data. Consequently, the models estimated in table 4 are based on 198 observations. 

+ PUSWTTF, PUSSMPTAR, and PDUM2 are the predicted values of USWTTF, USSMPTAR, and DUM2 on the 
instrumental variables from the first stage of the two-stage least-squares estimation procedures. 


117, 


118 JOURNAL OF POLITICAL ECONOMY 


The discussion of Section I suggested that ESCAL(67) should be 
negatively and significantly related to tariff and nontariff trade re- 
strictions if the government is strongly concerned about the overall 
welfare costs of protection. ESCAL(67) appears in all of the weighted 
tariff and nontariff trade barrier dummy regressions with the ex- 
pected negative sign and generally significant coefficient. While the 
sign of ESCAL(67) is negative, it is less significant in the simple tariff 
regressions. 

In Section I we indicated that we expected to find a negative and 
significant coefficient for the absolute value of the own price elasticity 
of demand for the product in the trade restriction equations. On the 
assumption that the own price elasticity of demand is reduced by 
product heterogeneity, we used a measure of product differentiation, 
WDAVPD, as an inverse measure of the absolute value of the own 
price elasticity of demand. Consequently, we expected to find a posi- 
tive and significant coefficient on the product differentiation variable 
in the tariff and nontariff trade barrier regressions. The relative 
negative significance of WDAVPD in the nontariff trade barrier re- 
gressions compared with the tariff regressions may be partially re- 
lated to another factor that we have not discussed. Clearly, nonprice, 
quantitative restrictions are more easily applied and enforced if 
products are fairly standardized. Consequently, the negative 
coefficient on product heterogeneity may partially reflect an admin- 
istrative preference by the government for nonprice restrictions for 
homogeneous products. 

Earlier we argued that while market conditions change continu- 
ously, it may still make sense for industry interest groups to push for 
changes in trade restrictions in discrete fashion. If so, we would 
expect tariffs to be relatively more productive in industries that are 
expected to contract and nontariff trade barriers to be relatively more 
productive in expanding industries. We used the percentage growth 
in apparent domestic consumption between 1965 and 1970, mea- 
sured by the percentage growth in domestic shipments plus imports 
minus exports, as a proxy for demand shifts, AD, and the percentage 
growth in imports between 1965 and 1970 as a proxy for supply 
shifts, AS. Clearly, both the apparent:domestic consumption variable 
and the import expansion variable reflect both demand and supply 
phenomena and in that sense are poor proxies for the effects of pure 
demand and supply shifts that we are trying to measure. With that 
caveat in mind, we simply report that neither of the variables included 
to measure dynamic influences has any significant explanatory power 
in the tariff or nontariff trade barrier regressions. 

Unfortunately, we have no good measures of the industry-specific 
revenue effects associated with tariff and nontariff trade restrictions, 
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or of the administrative costs of implementing alternative forms of 
trade restrictions across industries. Consequently, all of our estimates 
suffer from the potential biases associated with important left-out 
variables. 


III. Conclusions 


We presented evidence that both tariff and nontariff trade restric- 
tions are found predominantly in industries in which the United 
States has no comparative advantage vis-a-vis the rest of the world and 
away from industries in which the deadweight losses to consumers 
from protection would be high. In addition, we presented direct 
evidence that nontariff trade restrictions have been used to supple- 
ment tariffs and, thereby, offset the trade-liberalizing effects of 
post-World War II tariff agreements. 

We also presented evidence that there are significant differences in 
the industrial characteristics of industries with tariff protection com- 
pared with those with nontariff trade protection. Specifically, tariffs 
are positively and significantly related to labor intensity and nega- 
tively and significantly related to the capital/labor ratio while just the 
opposite is true for nontariff trade restrictions. In addition, nontariff 
trade restrictions are negatively and significantly related to both seller 
concentration and geographical concentration in an industry while 
both characteristics are positively and insignificantly related to tariffs. 


Appendix 
Definitions of Variables 


Independent Variables 


FTDNMOS—Product cycle proxy: unweighted average of Schedule B first 
trade dates corresponding to TCSIC as of January 1974. 

CONR4L —Concentration Ratio, 1970: percentage of shipments accounted 
for by the four largest firms in the industry. 

SKILLD  —Skills measure, 1970: professional and kindred workers, plus 
managers and administrators (except farm), plus craftsmen 
and kindred workers, as a percentage of total employment. 
Based on three-digit SIC data with values repeated at four-digit 
levels. 

SEINRD —Percentage of scientists and engineers in R & D in 1970. Based 
on two-digit SIC data with values repeated at the four-digit 
level. ; 

ESCAL(67)—Economies of Scale measure, 1970: value of the exponent in 
the regression equation V = KN*, where V is the ratio of value 
added in plants employing N persons to average values added 
for the industry, and K is a constant. 
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LABINT —Labor Intensity Ratio, 1970 (payroll divided by value added). 

KLRA —Capital/Labor Ratio, 1970: Total capital stock divided by em- 
ployment (thousands of dollars). 

WDAVPD —Weighted Average of Product Differentiation measure, 1970: 
The measure is the coefficient of variation in the unit values of 
exports destined to different countries weighted by U.S. export 
shares, i.e., the standard deviation of U.S. export unit values 
weighted by export shares divided by means of unit values. 

AIMP —Percentage change in imports between 1965 and 1970. 

ACON —Percentage change in apparent U.S. consumption between 
1965 and 1970, where apparent U.S. consumption is measured 
by the value of U.S. shipments plus imports minus exports. 

GEOG —See n. 9. 


Dependent Variables 


Tariffs -USWI'TF—U.S. weighted-average tariffs for 1970 and USSMP- 
TAR—US. simple average tariffs for 1970 are taken from the GATT 
Tariff Study, Geneva, 1970. 

Index of Nontariff Barriers—Index of the Incidence of Nontariff Barriers in 
the U.S., 1970: The index was constructed by the U.S. Tariff Commission 
and is explained in Trade Barriers, Report to the Committee on Finance of 
the U.S. Senate, Part 2, pp. 160-72, Washington, D.C: April 1974" Seen: 6 

Dummy Variable for Nontariff Barriers in the U.S., 1970—DUM2—The 
value of the dummy variable was set equal to 1 if the index of nontariff 
barriers was nonzero for a particular product and zero otherwise. 
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Inflation, Corporate Income Taxation, and the 
Demand for Capital Assets 





Richard W. Kopcke 


Federal Reserve Bank of Boston 


The demand for capital is not systematically related to either the 
level or the rate of change of “effective” income tax rates on corpo- 
rate capital assets. Rising inflation during the last 10 years has raised 
the user cost of capital for durable assets relative to that for short- 
lived assets even though this inflation has raised effective tax rates 
for more durable capital less than for short-lived assets. Even with 
replacement-cost depreciation allowances, the level and pattern of 
investment incentives probably will continue to vary with the 
inflation rate. 


The past decade of high and rising inflation rates has encouraged the 
study of U.S. income tax codes and inflation’s influence on the de- 
mand for capital assets. Many have examined the “effective income 
tax rate” on capital to describe how inflation has altered both the level 
and pattern of investment incentives. In a recent article, for example, 
Auerbach (1979) observes: “With a positive rate of inflation, a historic 
cost depreciation rule biases the choice of asset life toward greater 
durability” (p. 621). Such a conclusion, based on the behavior of 
effective income tax rates, contradicts the spirit, if not the letter, of 
much traditional analysis of investment incentives and the demand 
for capital (see, e.g., Black 1959; Brown 1962; Samuelson 1964; Hall 
and Jorgenson 1967; Musgrave and Musgrave 1976; and Boadway 
1978) so it warrants examination. 

One principal conclusion of this paper is that the demand for 
capital is not systematically related either to the level or to the rate of 


The analysis and conclusions are not necessarily endorsed by the Federal Reserve 
Bank of Boston or the Federal Reserve System. 
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change of the effective corporate income tax rates on capital assets. 
Another principal conclusion is that the rising domestic inflation rate 
of the past 10 years not only has reduced investment incentives 
generally, but it has reduced the incentive to acquire more durable 
capital goods most severely. Even if U.S. corporate income tax codes 
were amended to embrace replacement-cost depreciation allowances, 
investment incentives would not necessarily be independent of asset 
service lives nor would the relative demand prices of capital goods 
necessarily be independent of the inflation rate. 


I. Inflation and the Demand for Capital 


Assuming perfect markets and certainty of foresight, investors 
maximize their wealth by purchasing capital yielding no less than p, 
the real discount rate for after-tax corporate cash flow. In other 
words, the real price of capital, P (the output price is the numeraire),! 
equals 


i el —z)e*Q, + rPD(t, d)e™ |dt + P - ITC, or (1) 
0 


Cia 
(p + 8) 


where 7 is the corporate tax rate, 6 is the rate of decay of capital, Q; is 
the marginal physical product of a unit of new capital, D(¢, 6) is the 
schedule of depreciation allowances per dollar of investment under- 
taken ¢ years ago, ITC is the rate of investment tax credit, Z is the 
present value of depreciation allowances per dollar of investment, 
and IT is the inflation rate. Formula (1) represents the present value of 
after-tax operating income plus the present value of the tax shelter 
afforded by depreciation allowances and tax credits; D(t, 5) is “dis- 
counted” by II because depreciation allowances are linked to the 
original purchase price of capital assets, so the real value of these 
allowances must decline as nominal capital goods prices rise. 
In equilibrium, then, 


Coot tp 0) 1 77,)/(1 — 7) = O;.,, 
CE =P {p+ 8,)(1 — 1Z, — ITC)/( = 1) = One, 


Ope TP Zar Glo. 


(2) 


or the user cost of capital (Hall and Jorgenson 1967)—CS for non- 
residential structures, CE for equipment—equals the marginal physi- 
cal product of capital. Because structures and equipment are each 


"For the most part, the notation matches that in Auerbach’s (1979) article. P may 
equal unity as it does in Auerbach’s article; however, it is often useful to relax this 
assumption because the relative prices of capital assets are not constant. 
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represented by a variety of capital goods bearing different rates of 
decay while offering different flows of capital services, A (6)Q,, equi- 
librium for both equipment and structures demand prevails when 


aes -V wim = Quis! Que: (3) 


Referring to the equilibrium condition described in expression (2), 
it is evident that a changing inflation rate, II, can alter the demand for 
capital. Higher inflation rates, other things equal, depress the present 
value of real depreciation allowances; this, in turn, raises the user cost 
of capital. From (3), it is also evident that if higher inflation rates do 
not increase the user costs of all capital goods proportionately, the 
pattern of investment demand will change as well. 

In equilibrium, 


CS(6:)_ _ A(61) 


Cy | AGS (4) 
(82) A(z) 


Should changing tax rates tend to increase CS(6,) relative to CS (6), 
P,(8,) and P,(6,) will compensate so that equilibrium is maintained. Of 
course, if the new balance requires the price of more durable struc- 
tures, P,(6,), to decline relative to that of less durable structures, 
P,(5,), and if the supply of structures varies with price, then the stock 
of nonresidential structures will become less durable. The influence 
of inflation on the demand price of capital and the choice of asset life 
depends on the sensitivity of the present value of depreciation allow- 
ances to changing inflation rates, 0Z/0II, as well as the importance of 
these/allowaneées as, a tax shelter,\(19— 72,5 0, ) 

Table 1 shows how rising inflation rates reduce the present value of 
real depreciation allowances for a variety of capital goods according to 
current U.S. corporation tax law. The table illustrates two important 
points. First, the value of depreciation allowances is lower for longer- 
lived capital assets. Second, with higher inflation, the value of depre- 
ciation allowances drops at a faster rate for more durable capital. 
Therefore, the modest role of depreciation allowances in the cash 
flow accruing to durable capital does not mean that these assets are 
necessarily less sensitive to changing inflation rates than are less dura- 
ble assets. 

In fact, as shown in table 2, the tax shelter offered by depreciation 
allowances is sufficiently important for long-lived capital that higher 
inflation rates increase the user cost of capital for these assets more 
than that of less durable assets. As the annual inflation rate rises from 
zero to 6 percent, other things equal, the investment incentives for 
40-year structures fall 22 percent, but the incentives for 20-year 
structures fall only 19 percent. Investment incentives generally drop 
much less for the tabulated categories of equipment. 


CORPORATE INCOME TAXATION 














125 
TABLE 1 
THE PRESENT VALUE OF REAL DEPRECIATION ALLOWANCES 
PER DOLLAR OF INVESTMENT 
A. PRopuceRs’ DurABLE EQUIPMENT* 
ASSET LIFETIME 
(Years) 

INFLATION RATE 
(%) 5 10 15 
0 44 43 .42 
2 42 40 38 
4 Al Eoui .34 
6 39 35 Sail 
8 O71 Pos. .29 
10 36 SSI De 
20 30 2S 19 
30 25 18 .14 


eee ee ee eee ee ee 


B. NONRESIDENTIAL STRUCTURES* 








ASSET LIFETIME 








(Years) 

INFLATION RATE 
(%) 20 30 40 
0 4] 38 36 
2 OD soil .28 
4 30 .26 a 
6 ih oD 19 
8 24 19 .16 
10 2 ot 14 
20 15 .10 .09 
30 sa .08 .06 





Nore.—Entries equal nominal depreciation allowances multiplied by .46, the marginal corporate income tax rate, 
discounted by (1.015)(1 + I), where 1.5 percent is the real after-tax risk-adjusted discount rate (see Brown 1962) 
and II is the inflation rate. 

* Nominal depreciation allowances are calculated using the sum-of-the-years’-digits formula. 

* Nominal depreciation allowances are calculated using the 150 percent—declining formula with a switch to 
straight-line allowances after one-third of the asset’s life has passed. 


Because rising inflation reduces the present value of depreciation 
allowances most quickly for the longest-lived assets, however, the user 
cost of capital for these assets becomes progressively less sensitive to 
further increases in the inflation rate. For example, if the inflation 
rate were to rise from 6 to 10 percent, investment incentives would 
drop another 5 percent for 40-year structures compared with an 
incremental decline of 6 percent for 20-year structures. Should the 
inflation rate exceed 20 percent, the erosion of real depreciation 
allowances would be so severe for the shorter-lived structures that 
longer-lived structures become relatively more attractive investments. 

Despite a popular contention that the value of the depreciation tax 
shelter, hence its annual loss in value due to inflation, is greatest for 
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TABLE 2 


INFLATION AND INVESTMENT INCENTIVES 


A. Propucers’ DURABLE EQUIPMENT 








ASSET LIFETIME 








(Years) 
INFLATION RATE 
(%) 5 10 15 
0 1.00 1.00 1.00 
2 -96 .94 .92 
4 93 .89 .86 
6 .90 .85 81 
8 87 81 .78 
10 .85 .78 215 
20 ail .70 .67 
30 Bie, .65 .63 





B. NONRESIDENTIAL STRUCTURES 








ASSET LIFETIME 








(Years) 
INFLATION RATE 
(%) 20 30 40 
0 1.00 1.00 1.00 
2, 91 .89 .88 
4 85 83 .82 
6 81 79 .78 
8 78 nO .76 
10 .76 .74 .74 
20 .69 .69 .70 
30 .67 .67 .68 





Nore.—Entries for each asset equal the reciprocal of the user costs of capital, expressions (2), divided by the 
reciprocal of user costs when the inflation rate equals zero. Values of Z, and Z, are taken from table 1, and, for 
equipment, /TC equals .1 except for 5-year assets for which JTC equals .067. The figures in this table are 
independent of the pattern of “true” capital decay—inflation alters the relative user cost through the factor (1 — 7Z 
=7TCy, 
less durable assets, it is the percentage change in the effective pur- 
chase price of capital, P(1 — rZ — ITC), that influences investment 
incentives. Table 2 shows that higher inflation rates raise this effective 
purchase price most for more durable, not less durable, assets for 
relevant inflation rates in the United States. The more durable the 
asset, the more its depreciation allowances are postponed, hence the 
more inflation may “discount” the value of these allowances before 
they may be claimed, as shown in the first table. Consequently, high 
inflation rates discourage the purchase of longer-lived capital more 
than short-lived assets.” Ironically, the class of investment goods most 


> This is true whenever the pattern of investment demand for high inflation rates is 
compared with the pattern of demand for inflation rates near zero. As explained in the 
text, however, as the inflation rate rises from an already high level, the relative demand 
price of some longer-lived assets may rise. 
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needing “protection” from the high inflation rates of the past 15 
years—nonresidential structures—has not received the benefit of the 
tax credits and accelerated depreciation granted to producers’ dura- 
ble equipment. 

This shift in the relative capital costs that inflation induces will alter 
investment demand to favor less durable capital goods, thereby de- 
pressing the relative demand price for more durable capital assets. As 
a result, the stock of more durable structures, other things equal, will 
fall relative to that of less durable structures. To the extent that 
structures and equipment are substitutes, the stock of structures 
generally will decline relative to that of equipment. Of course, the 
higher user cost for all capital goods implies a declining capital-labor 
ratio in addition to this substitution among varieties of capital assets. 

Assuming that the real price for all capital goods equals unity, a unit 
of output may be consumed or installed either as short-lived equip- 
ment or as a very durable structure; consequently, changing rates of 
inflation cannot affect asset prices. Rather, from expression (4), if 
CS (6)/A (6) for a more durable factory exceeds that for a less durable 
factory due to increasing inflation, investors will install less durable 
structures; more durable factories are now economically obsolete. 

In summary, with current U.S. income tax codes, which link depre- 
ciation allowances to the original purchase price of capital assets, 
changing inflation rates alter the relative user costs of capital, other 
things equal. Higher inflation rates initially raise these user costs more 
for long-lived assets than for short-lived assets, thereby reducing the 
average service life of the capital stock. 


II. Inflation and the Effective Corporate Tax Rate 
Auerbach (1979) defines the investor’s implicit discount rate as 
v =c — 6, (5) 
and then the effective corporate tax rate, 
CG = iy — op) i, (6) 


In view of expressions (2) and (3), it is possible for 6 either to rise or to 
fall with the service lives of capital assets. Consider, however, the 
simple case wherein A (6) equals unity and prices, P (4), are flexible for 
all assets (the definition of a unit of capital has eliminated the appar- 
ent differences in productivity among pure substitutes); then (from 
[4]), CS(6,) = CS(6,) and 0(8,) > 0(6,): The effective tax rate on the 
more durable factory is largest, though the user costs are equal. 
As explained in the previous section, a higher inflation rate will 
tend to raise the user cost for long-lived structures more than that of 
less durable structures. A higher inflation rate, then, will also increase 
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Qxs by reducing the stock of nonresidential structures, other things 
equal. Retaining the assumption that 4(5) equals unity, from (4) 
CS (8;) will still equal CS (6,) in equilibrium, and both will be greater, 
with higher inflation rates as long as capital goods prices are flexible. 
Though CS(6,) and CS (62) rise proportionately, 6(6,) increases less 
than 0(8,) because 6), 52, and p do not change. Even though the 
effective tax rate, 9, on less durable capital rises more than that on 
longer-lived assets, the capital stock is comprised of more short-lived 
assets with higher inflation rates because P,(61) (the price of the more 
durable asset) has fallen relative to P,(6:) in equilibrium, as described 
after expression (4). 

If A(8) rises with 6 and if the prices of all capital were unity, a 
medium-lived asset may have a relatively high user cost and yet be the 
asset in greatest demand due to its high productivity, A (6). Con- 
sequently, the effective tax rate, 6, for this asset may exceed that for 
many alternative assets not favored by investors. 

Assume, for 6, < 5, < 83, that 


Css GSO eC 
mim ee 


GS(6,) = GS (62) <= CS(63)2 


, and 


(7) 


It is reasonable to assume CS'(5) < 1 (see n. 3); therefore, 
6(8,) = [CS (8) — 6, — pI/[CS(8,) — 6,] > 
O(52) = [CS (82) — 82 — pI/[CS (8x) — 82] > (8) 
6(53) = [CS (63) — 63 — p]/[CS (53) — 4s]. 


The medium-lived asset is the optimal investment according to (7), 
but expression (8) reveals that its effective tax rate is not the lowest. 

Just as the level of 6 is a poor guide to investment incentives, so the 
change in the effective tax rate is a poor guide to the change in 
investment incentives. From (7), the short-lived asset (63) will be pre- 
ferred to the medium-lived asset once inflation is high enough to 
cause 


Ot) 
CS(S: aed (9) 


because O[CS (63)/CS (8,)]/dll < 0.4 





3CS'(6) = [1 — 7Z) — t(p + 8)Z'(5)]/(1 — 7). According to table 1, tZ'(5) ~ 2, so the 
first term in the brackets is approximately 0.6 and the second is approximately 0.3; 
therefore, it is reasonable to assume 0 < CS'(8) < 1. Similarly, CS"(6) < 0. 

4 As shown in table 2, unless inflation rates exceed 20 percent, a higher inflation rate 
raises the user cost for less durable structures (6,) proportionately less than the user cost 
of more durable structures (8,). The relative user cost of less durable structures declines 
as inflation rises. 
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Though rising inflation depresses the user cost of short-lived assets 
relative to that of longer-lived assets, however, it need not depress the 
relative effective tax rate on shorter-lived assets: 00(53)/OII is not 
necessarily less than 00(62)/0II, nor is 0[@(83)/0(8,)]/OI < 0 always. 
From expressions (7) and (8), for example, 


aes 
0 In [CS (6;)]/oll CS (83) [CS (53) — 82) 


~~ din (CS(B.)VO | ~CS(B)  (CS(&,) S&F 


The product of the last two factors in the right-hand side of (10) 
exceeds unity;’ consequently, the left-hand side of (10) may exceed 
unity—the tax rate on less durable assets may rise more than that on 
longer-lived assets—even though CS (63)/CS (65) is falling. Whenever 
6(53) rises more than 6(6,), 6(63)/@(62) also increases because 0(63) < 
6(6) in this example. While relative user costs for shorter-lived 
structures fall with rising inflation, the effective tax rate on these 
assets may rise more than that on longer-lived structures. 
Finally, from (2), (5), and (6), assuming P = 1, 


Pt) G77 FT C1 —7)6)(o.+ 8)1,.-(L1) 


Assuming that the intertemporal discount rate (p) is constant, the 
effective tax rate on corporate capital must vary with the inflation rate 
unless the statutory tax rate (7) equals zero or depreciation allowances 
are adjusted for changing capital goods prices—D (t, 6) is replaced by 
D(t, 5)e™ in (1)—so that 0Z/dII = 0; D(t, 6) may correspond to any 
capital consumption scheme, including “economic depreciation.” 

It is likely, however, that p changes whenever the inflation rate 
changes;® therefore, the demand for capital may change whenever II 


5 See n. 3. Because CS'(6) > 0, CS (63) > CS(6,). CS'(6) < 1, however, implies that 
[CS (8,) — 6.) > [CS (53) — 63]}*. Therefore, both of the last two factors of (10) exceed 
unity. 

° in many standard neoclassical growth models, combining intertemporal consump- 
tion and production decisions (e.g., Sidrauski 1967; Intriligator 1971), p by hypothesis 
is constant. In other versions of this model the intertemporal utility function is replaced 
by a saving rate and portfolio balance relationship (Tobin 1955, 1965). In these latter 
models, higher inflation rates can lower the real return on money balances, thereby 
encouraging investors to acquire more capital, by reducing p, until its marginal physical 
product drops enough to restore portfolio equilibrium. In other models, p is neither 
constant nor essentially equal to the real return on money. Inflation is a by-product of 
public and private policies intended to redistribute resources (Schumpeter 1939; 
Robinson 1954, 1962; Pasinetti 1962). This reallocation of wealth, income, or spending 
undoubtedly will change the social rate of time preference, p. Although Tobin’s growth 
model suggests p would fall as inflation increases, Schumpeter’s model suggests p would 
rise. It is not unreasonable, then, to assume that higher inflation depresses p and, in 
turn, increases the average service life of capital if corporate income taxes are indexed. 
Yet, so little is known about the social rate of time preference that it is unreasonable to 
ignore the alternative conclusion: Higher inflation rates accompany an increase in p, 
thereby depressing the average service life of capital. 
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changes, unless this demand is also independent of p. By setting EG 
= 0 and either setting 7 = 0 or allowing depreciation schedules that 
are a linear combination of “economic depreciation” adjusted for 
changing capital goods prices and immediate expensing of investment 
expenditures, not only is the effective tax rate independent of the 
inflation rate, but it is independent of the discount rate, p, and asset 
service lives. (Even if depreciation allowances were “indexed” for 
inflation, unless these allowances correspond to the pattern of an 
asset’s “economic depreciation,” the effective tax rate will not be 
independent of the asset’s service life [see Samuelson 1964].) Of 
course, these last reforms accomplish this goal by pegging the effec- 
tive tax rate at zero. Nevertheless, even though 6 = 0 for all values of 
II, 5, and p, these reforms cannot make the user cost of capital 
independent of p and 6 because (from [2]): 


C =(p + 8). (12) 


The demand for capital, therefore, is not systematically related 
either to the level or to the rate of change of the effective corporate 
tax rate defined by expressions (5) and (6). The demand for capital 
depends on the user cost of capital and the marginal physical product 
of capital alone in neoclassical macro models, so the income tax codes 
influence investment incentives only by altering capital’s user cost. For 
studying the demand for capital, then, the suggested effective corpo- 
rate tax rate, 0, is not a sufficient statistic—it alone is not a definitive 
measure of investment incentives. Conversely, the user cost of capital 
does not measure the effective tax burdens borne by the various 
capital assets. Each concept—the cost of capital and the effective tax 
rate—serves a particular purpose, and their roles are distinct. 


II. Summary 


Unless the aggregate rate of time preference has declined substan- 
tially, the rising inflation rate of the past 10 years not only has reduced 
business’s demand price for capital generally but it has reduced the 
relative demand price for more durable assets most severely. Even if 
the corporate tax rate were zero or depreciation allowances were a 
linear combination of economic depreciation adjusted for changing 
capital goods prices and immediate expensing of investment ex- 
penditures, thereby pegging the effective corporate income tax rate 
at zero, the level and pattern of investment incentives most likely 
would vary with the inflation rate. 

Of course, it is one thing for inflation to influence investment 
incentives directly; it is quite another for changing investment incen- 
tives to accompany a changing inflation rate because the intertem- 
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poral discount rate varies with the inflation rate. Although it is prob- 
ably unreasonable to expect any tax reform to insulate the demand 
price of capital from the influence of policies that reallocate resources, 
any schedule of depreciation allowances adjusted for changing prices 
can insulate investment incentives from inflation per se. 
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This paper develops and estimates a model of the joint determina- 
tion of the exchange rate, international reserves, and the rate of 
inflation under a crawling-peg system. The framework presented, 
which is an extension of previous work on the monetary approach, 
generates short-run deviations from purchasing power parity that 
occur simultaneously with movements in both international reserves 
and the exchange rate. The model is estimated by full-information 
maximum likelihood on the basis of quarterly data for Brazil. 


With the advent of the modern literature on the monetary approach 
to the balance of payments and to the exchange rate, and following 
the seminal contributions of Mundell (1968, 1971), Johnson (1972), 
and Dornbusch (1973), a large amount of effort has been devoted to 
emphasizing the role of the money market and asset markets in the 
determination of the balance of payments and the exchange rate. The 
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long-run effects of monetary shocks in a regime of fixed exchange 
rates, the effects of devaluations and changes in commercial policy in 
a monetary setting, and the short-run consequences of monetary 
imbalance on the inflation rate and the balance of payments have 
recently been the object of careful theoretical and empirical study. 
The importance of monetary variables in the determination of the 
exchange rate in a regime of floating rates has also been analyzed in 
detail.! 

Most of these studies, however, have dealt either with cases in which 
the exchange rate has been kept fixed for very long periods or with 
free-floating cases in which the exchange rate is predominantly de- 
termined by the interaction of market forces without government 
intervention.” 

Yet in recent years many countries have experienced simultaneous 
fluctuations in their exchange rates and international reserves. These 
fluctuations have occurred either under a managed-float system, 
characterized by government intervention in the foreign-exchange 
market, or under a crawling-peg system, in which the authorities 
determine, and periodically change, the country’s exchange rate.? In 
order to analyze the experience of countries with these characteristics, 
an extension to the previous formulations of the monetary approach 
is required. 

In this paper we develop and estimate a model for the analysis of 
the joint determination of the exchange rate, international reserves, 
and the rate of inflation under a crawling-peg system. Section I 


‘A number of important contributions on the topic are collected in Frenkel and 
Johnson (1976, 1978) and in the International Monetary Fund (1977) volume. A review 
of the empirical evidence on the monetary approach to the balance of payments and 
exchange rates is presented in Magee (1976). Johnson (1977) and Whitman (1975) 
present comprehensive reviews of the recent contributions to the monetary-approach 
literature, while a critique of some of the central aspects of that approach is found in 
Hahn (1977). All these deal with fixed-exchange-rate regimes. Among those dealing 
with floating rates are Kouri (1975), Dornbusch (1976), Frenkel (1976), Mussa (1976), 
and Frenkel and Clements (in press). 

* Exceptions are the studies by Artus (1976) and by Girton and Roper (1977). The 
latter develop and test a monetary model of exchange-market pressure, which is 
defined as a composite variable that includes changes in both the exchange rate and 
international reserves. This model was recently applied to Brazil by Connolly and da 
Silveira (1979) who, using annual data, attempt to explain the behavior of the 
exchange-market pressure variable. As will be seen, our model is designed to deal 
simultaneously with the joint determination of each of the two components of 
exchange-market pressure as well as of the domestic rate of inflation. 

3 A theoretical analysis which concludes that the optimum exchange-rate regime will 
correspond to neither of the extremes of a completely fixed or a completely flexible rate 
is presented by Boyer (1978) and Frenkel (1980), who concludes that “for the purpose 
of empirical work it is useful to design a framework of the adjustment mechanism that 
can accommodate simultaneously changes in international reserves and changes in the 
exchange rate.” 
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presents the basic model and uses it to study the effect of changes in 
domestic monetary policy on the evolution of these three endogenous 
variables. The empirical implementation of the model, presented in 
Section II, uses quarterly data from Brazil covering the crawling-peg 
period, 1968-77. The parameters of the model are estimated simul- 
taneously using a full-information maximum-likelihood method. 
Concluding remarks are presented in Section III. 


I. The Model 
Monetary Equalibrium 


The model is a variant of the monetary approach to the balance of 
payments. Its main feature is that it accounts for different degrees of 
exchange-rate flexibility (from a freely floating parity to a completely 
fixed exchange rate). We take the case of a small country, defined as 
one whose international price of traded goods is exogenously deter- 
mined, and we allow for the existence of “nontraded” goods, defined 
as goods whose price responds, at least in the short run, to domestic 
monetary variables. 
The basic relationships of the monetary sector are 


M, =a(kh + D), (1) 
Ma = Pma, (2) 
ma =f, 7°), >) 


where M, is the nominal supply of domestic money; a is the money 
multiplier; R is the foreign-exchange reserves held by the central 
bank; D is the domestic-credit component of the monetary base; Mq 1S 
the demand for nominal cash balances; P stands for a price index that 
includes traded and nontraded goods; and mq is the real demand for 
money, assumed to be a function of real income,‘ y, and of the al- 
ternative cost of holding money, proxied by 7°, the expected rate of 
inflation.°® 

Although it is possible to postulate a mechanism of lagged adjust- 
ment, as is done in the empirical section, here we assume for 


4 We assume that the rate of growth of real income is exogenous with respect to 
monetary variables (we therefore assume a version of the natural rate hypothesis). 
Although restrictive, this assumption allows us to focus on the specific effects of money 
in an open economy in the context of the monetary approach. At the empirical level we 
found evidence supporting our assumption for the case under consideration. A com- 
plete report of the relevant causality tests is available from us on request. 

> As it happens, the results below are valid under a large variety of assumptions about 
the mechanism for the formation of 7°, provided that the latter depends on lagged 
variables. In the empirical applications, we assume that 7° is formed by a version of 
rational expectations; see Sec. II. 
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simplicity that the money market clears in each period so that the 
nominal stock of money is equalized, ex post, with the demand for 
nominal balances. The assumption requires the existence of the flow 
equilibrium 


M} = Mi, (4) 


where the asterisk indicates the percentage rate of change of the 
variable. Differentiating equations (1) and (2) logarithmically, the 
flow-equilibrium condition can be rewritten as 


a* + (1 — y)R* + yD* = P* + mi, (5) 


where y is a factor of proportionality equal to D/(R + D) and P* is the 
domestic rate of inflation. 


The Domestic Rate of Inflation and the Balance of Payments 


When traded and nontraded goods are both present, the domestic 
rate of inflation can be measured as a weighted average of the rate of 
change of the price of both kinds of goods, 


P*¥ = \P# + (1'— A) PR (6) 


where Py, Pyz is the price in domestic currency of traded, nontraded 
goods, respectively, and A is the share of traded goods in total ex- 
penditure. In a small economy P# is determined by the world rate of 
inflation (P*) and by the variations in the effective exchange rate (p*): 


P% = P* + p*. (7) 


The price of nontraded goods, however, may be affected by 
domestic factors, at least in the short run. Since an ex ante excess 
supply of money implies excess demand in the goods market, and if 
excess demand for nontraded goods varies monotonically with excess 
demand throughout the economy, we can expect the price of non- 
traded goods relative to that of traded goods to be a function of 
imbalance in the money market, 


P%, = P& + 00, (8) 


where 1) is a measure of monetary imbalance and @ is the elasticity of 
the relative price with respect to the monetary variable.° 
An important feature of equation (8) is that it includes only con- 


6 The elasticity 6 is a function of the elasticity of substitution between traded and 
nontraded goods in consumption and production and of the income elasticity of the 
nontraded goods. For a detailed description of the dynamics of domestic-price deter- 
mination in a monetary model with traded and nontraded goods, see, e.g., Parkin 


(1974) and Blejer (1977). 
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temporaneous variables. In practice, however, it may well be that PXr 
depends on lagged P# and © reflecting persistence effects. To allow 
for these effects, it is possible to postulate a relationship of the form 


(Pk — P#): = 0, + a(PXr — P#)e1 (9) 


where ¢ is a time subscript and a is the adjustment coefficient. Empiri- 
cally, however, we found that our results were quite insensitive to the 
particular specification adopted. We, therefore considered equation 
(8) as the relevant representation for relative-price determination. 
The structure of the model resulting from equation (9) and the results 
obtained using that specification are presented in the Appendix.’ 

As regards the measure Q, it is important to remember that a 
central conclusion of the monetary approach to the balance of pay- 
ments is that in a small open economy the nominal supply of money 
may be beyond the control of the monetary authority. Under fixed 
exchange rates the monetary authority can only determine the ex ante 
quantity of money by changing the domestic-credit component of the 
base or by manipulating the money multiplier. In conjunction with 
the flow demand for real balances generated by adjustments in the 
desired stock, such measures create an ex ante excess flow supply of 
money to which the public reacts by changing the level of the interna- 
tional reserve component of the base through the balance of pay- 
ments and by affecting the rate of domestic inflation. The ex post 
nominal quantity of money in an open economy is then influenced by 
the public’s response to ex ante conditions in the money market.* 

It appears, therefore, that the relevant measure to account for the 
monetary effects on the goods market in an open economy should be 
an ex ante measure which does not include the endogenous reaction 
of the foreign component of the base. For that reason we define © in 
equation (8) as the gap (in percentage terms) between the ex ante 
change in the money supply (i.e., a change in the domestic-credit 
component of the base and in the money multiplier)® and changes in 
demand. Equation (8) can therefore be rewritten as 


Py» = P& + 0(yD* + a* — M4). (10) 


7 Eq. (8) ignores possible real sector effects on the relative price of nontraded 
goods (e.g., real wages, income policies). Although the equation can be modified to take 
these factors into account, this would result in a framework far beyond our intended 
extension of the monetary approach to the case of a crawling peg. 

* For time-series analysis of European data and econometric tests of the interaction 
between changes in domestic credit and in international reserves implied by the mone- 
tary approach, see Blejer (1979) and Leiderman (1980). 

° Empirical examination of the assumption of exogeneity of (yD* + a*), on the basis 
of causality tests, indicates nonrejection of the assumption; complete test results are 
available from us on request. 
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Substituting (7) into (10) and then (7) and (10) into (6), we obtain, 
after some manipulation, the following expression for the rate of 
domestic inflation:'° 


PS e(Py wp) 1(l— e)(yD* — m4); (11) 


where € = [1 + 6(11 — A)". 

In addition to changes in the price level, there are also changes in 
international reserves operating to restore monetary equilibrium. An 
expression for the money account of the balance of payments, which 
is equal to the change in the international reserves held by the central 
bank, may therefore be obtained by substituting P* in equation 
(5)—the flow equilibrium condition for the money market—for its 
value in (11) and rearranging the terms: 


(eye eet pag yD"). (12) 


Equations (11)—(12) present the domestic rate of inflation and the 
balance of payments as functions of world inflation, exchange-rate 
policy, and the rate of ex ante excess flow supply of money. When 
nontraded goods are absent (A = 1) or when their price is not sensitive 
to monetary imbalance (@ = 0), then € = 1, and the model is similar to 
the classical long-run formulation of the monetary approach (see 
Johnson 1972). In such a case, domestic monetary variables do not 
affect the domestic rate of inflation which, if the exchange rate is not 
altered, is fully determined by the world rate, and every ex ante 
monetary shock will lead to reserve depletion due to a balance-of- 
payments deficit. 


The Endogeneity of the Exchange Rate in a Crawling-Peg System 


Except in a fully flexible exchange-rate system (or in a managed float), 
the exchange rate is regarded by governments as a policy instrument, 
and its fluctuations are generally influenced by policy decisions aimed 
at one or more goals. Unlike under an adjustable-peg regime, under a 
crawling-peg system the exchange rate is changed frequently accord- 
ing to some set of rules adopted in order to attain the government's 
objectives.'' The variation of the exchange rate can therefore be 
considered as following a sort of reaction function which reflects 
policy goals as well as the parameters of the model adopted and the 


10 Assuming, for simplicity, a constant money multiplier, i.e.,a* = 0. In the empirical 
section, however, changes in the money multiplier are explicitly considered. 

1! A number of proposals for the operation of the crawling peg as well as analyses of 
the stability of the system have recently been presented in the literature; see, e.g., 
Williamson (1966), Cooper (1970), Kenen (1975), Levin (1975, 1977), and Mathieson 
(1976). 
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values of the exogenous and endogenous variables considered rele- 
vant for the desired goals. The goals, and therefore the crawling 
rules, may vary from country to country.” 

With a view to the empirical implementation of the model for 
Brazil, we postulate here that the policy objective is to avoid long-run 
changes in the real exchange rate and that the nominal rate (p) 1s 
therefore altered to maintain purchasing power parity.’ We assume, 
in addition, that the full adjustment of the exchange rate may take 
more than one period, and we shall analyze the effects of differences 
in the speed of exchange-rate adjustment. The reaction function 
implied by the policy rule 1S 


pit = 6 > (1 — BYLY(P* — Pé). (13) 


where ¢ is a time subscript, 8 indicates the portion of the current 
differential rate of inflation transmitted to the exchange rate in the 
current period, and L is the lag operator (Lx Xpea 

This formulation specifically assumes that, in addition to the 
current-period adjustment, the exchange rate will continue to change 
in each subsequent period by a fraction 8 of the still unadjusted 
differential until the whole differential has been transmitted to the 
exchange rate. Obviously, the greater is B, the faster will the rate 
adjust. If 8 = 1 our model does not differ conceptually from the 
monetary-approach model of free-floating exchange rates, since the 
rate of depreciation is then fully determined by the domestic-foreign 
inflation differential (see Frenkel 1976). 


The Functioning of the Model 


We proceed now to solve the model for the three endogenous vari- 
ables in which we are interested: the rate of inflation, the rate of 


2 Kenen (1975) analyzes in detail the relative efficiency of a number of alternative 
sliding-parity rules. Mathieson (1976) studies the consequences of using a welfare— 
instead of a balance-of-payments—objective as the guideline for the crawl. 

‘3 The appropriateness of this assumption for Brazil is discussed in Sec. II. In a 
previous version of this paper a number of alternative rules were incorporated into the 
model, among them maintaining a given level of nominal reserves (R* = 0) and 
maintaining a given level of real reserves (R* — P% = 0). Although the dynamics of the 
system change with the policy rule, its basic structure is not affected. For presentational 
convenience the functioning of the model is here confined to a single rule. 

4 A reaction function of this type follows Dean (1974) in the sense that endogenous 
target variables are a function of the current values of other endogenous variables; this 
is justified on the grounds that the latter are a plausible representation of the structure 
of the model and may therefore provide prior knowledge of the structural-form 
coefficients. This differs from the approach of earlier works such as Friedlaender 
(1973) where endogenous variables depend only on exogenous or lagged-endogenous 
variables. 
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change of foreign-exchange reserves, and the rate of change of the 
exchange rate. Combine (11) and (13) to solve for p¥, 


= ple) Soe 
gee BE Oe PD (14) 
and substitute this in (11) and (12) to obtain 
pee Cat) ie) 
Been rae (15) 
Ol Sree] Ut BE) 2 
"T= fe= Ua pe me 
and 
= By ere BUR DNS -_ 
ee eiraes SEA ea OND (Pi + mq — yD*),. (16) 


The three endogenous variables are functions of current and 
lagged values of foreign inflation and domestic monetary variables. 
We can consider now which type of monetary policy will equalize the 
domestic rate of inflation to the world rate. Because the coefficients of 
P* and (yD* — m*) in equation (15) add to unity, P* will equal P* when 
the monetary authority expands the money supply at the rate neces- 
sary to satisfy the growth in real demand and to replace the depre- 
ciated value of the nominal stock. This is achieved when the exoge- 
nous component of the supply of money (domestic credit) expands at 
a rate that exceeds the growth in the demand for real balances by the 
world rate of inflation, that is, when the ex ante excess flow supply of 
money is equal to the world inflation rate, 


(yD* — m%) = Pi, (17) 
which implies 
P* = Px (18) 
and 
p* =R* = 0. (19) 


To illustrate the functioning of the model further we can analyze 
the effects of a domestic-credit shock. To do so, it is convenient to 
consider an economy whose initial equilibrium satisfies equations 
(18)—(19). Starting from such an equilibrium position, and as long as B 
and € are smaller than unity, an acceleration in the rate of expansion 
of the domestic-credit component of the monetary base, A(yD*), will 
cause the rate of domestic inflation to depart from the world rate 
(here assumed constant), the exchange rate will rise, and a balance- 
of-payments deficit will be created. 
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If the rate of growth of domestic credit is sustained at the new, 
higher level, the exchange rate will eventually adjust to account fully 
for the differential in the inflation rates, which will then be equal to 
A(yD*).'5 Once the changes in the exchange rate fully reflect the 
differential rate of inflation, the excess supply of money created by 
the government in each period is fully eliminated through an increase 
in domestic prices, and no further flows of foreign exchange reserves 
occur. This process is illustrated in figure 1 for different values of B 
and «.'6 As B increases, for given values of e, the exchange rate adjusts 
faster to the monetary shock, and domestic inflation both diverges 
faster from world inflation and converges faster to its new equilibrium 
rate. As can be observed by comparing paths III and II in fioureslethe 
balance of payments deteriorates less and returns to equilibrium more 
rapidly (implying a smaller total loss of reserves) as B increases. When 
8 — 1, the system approaches a flexible-exchange-rate model, like the 
one presented by Frenkel (1976), in which domestic inflation is always 
independent of world inflation and the balance of payments is always 
zero. Letting B = 1 in equations (14)—(16), we obtain 


Ca Daa). (20) 
pr GDs= mz = Pp (PY ae (21) 
R¥ = 0. (22) 


The speed of adjustment will also increase, for given values of B, 
the lower the e, that is, the higher the share of nontraded goods in 
expenditures, A, or the higher the elasticity of relative prices with 
respect to monetary imbalance, 6. The more rapid adjustment of the 
rate of exchange when € falls (which also implies a faster acceleration 
of the rate of inflation although a smaller loss of reserves during the 
adjustment process) is illustrated by the comparison of paths II and I 
insfisure 1: 

Thus the results indicate that under a crawling-peg system a small 
country can choose its own long-run rate of inflation independently 
from the rest of the world. However, balance-of-payments deficits 
and surpluses are experienced in the process of adjustment, owing to 
purchasing power disparities which arise because the exchange rate 
takes some time to adjust fully. 

We turn now to the analysis of the empirical results obtained by 
applying the model described above to the case of Brazil. 


's That is, the difference between the new rate of ex ante excess supply of money 
[yD* + A(yD*) — mj] and the world rate of inflation. 

18: For pero! simplicity the figure does not consider the effects of the tempo- 
rary reduction in mj as in flationary expectations accelerate. The pattern of adjustment 
is, however, very similar when these effects are taken into account. 
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(pD*-mg),+4pD* p* 


RATE OF CHANGE 


AyD” =aP* 





(pD*- mg) = Py = P* 


ApD* = Ap* 








Fic. 1 


II. Empirical Investigation 


Estimation Procedures 


In this section, the model developed above is restated in econometric 
form. For estimation purposes, we work with the following system: 


log m, = 6, log y, — d27¢ + 8; log my_, + Mit, (3') 

P# = €(P + p*), + (1 — (a* + yD*— mB) + px, (11) 

(1 — y)R# = €(PS + p* + m¥— yD* —a*) + ys, (12) 

pt = B(P* — P®), + B(L — B)(P* — PB (13) 


pop lB) PS Piha I Mar. 
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Equation (3’) is the empirical counterpart of the model’s demand for 
real money balances. The specification is semilogarithmic, in that the 
log of real money balances depends on the log of real income and 
expected inflation. As the model is applied to quarterly data, we 
postulate a process of gradual adjustment of real balances to their 
optimum value, and that is the rationale for the appearance of lagged 
money balances in the equation.” Equations (11')—(13’) are stochastic 
variants of equations (11)—(13) above."® 

The following stochastic specification is adopted: It is assumed that 
wu; has a first-order autoregressive representation bi = Pili + Vit 
(for? = 1, 2, 3, 4), where @¢; is the autoregressive parameter, and vj is 
the error term. Furthermore, we assume that the error vectors, V4;, V2; 
Uy, Ve (fort = 1, ...,7) have zero mean and a constant variance- 
covariance matrix I’; also, they are serially uncorrelated and drawn 
from a multivariate normal distribution. 

Equations (3') and (11’)—(13’) constitute a simultaneous system, 
which is linear in the variables but nonlinear in (some of) the param- 
eters. Given these characteristics, consistent estimation requires the 
use of some simultaneous-equation method. We have used a full- 
information maximum-likelihood (FIML) estimator; FIML utilizes all 
the a priori restrictions on the system to estimate the coefficients of 
interest (in our case, €, B, ¢;, 6;) by maximizing the model’s likelihood 
function.!9 

To make the estimation operational, a time series of m3, the rate of 
change of real money demand, is required. As money demand is 
simultaneously estimated with the other equations of the model, the 
complete system is estimated under the restriction that m3, corre- 
sponds to the first difference of the fitted values of equation (3’). In 
addition, note that the estimation of money demand requires a proxy 
for 7°. This proxy is constructed by assuming a version of the rational 
expectations hypothesis. Specifically, it is postulated that agents form 
their inflation expectations on the basis of the least-squares prediction 
of inflation, conditional on the set of available information. We have 


‘7 Formally, the demand for money is given by log ma, = 5; log y; — 537%. Money 
demand equations of this form have previously been estimated for Brazil by Silveira 
(1973) and Khan (1977). We assume an adjustment mechanism of the form (log m, — 
log m:-1) = h(log ma — log m,-,), 0 <h < 1. See Griliches (1967) for a discussion of the 
conditions under which adjustment mechanisms of this class are likely to be optimal. 

'* As can be seen, we are now taking into account changes in the money multiplier. 
Note also that we consider only three terms in the exchange-rate equation (this issue is 
discussed further below). 

"See Hendry (1971). The parameter estimates reported below were obtained by 
using Wymer’s (1978) computer program, REsIMUL. The program uses a Newton- 
Raphson iterative procedure, beginning with arbitrarily given initial values of the 
parameters. 
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assumed that the relevant information set includes lagged values of 
the variables which, according to the model, determine the rate of 
inflation.?° 


A Crawling-Peg Experience: The Case of Brazil 


Brazil’s adoption of the crawling-peg regime can be considered as an 
extension of indexation to the external sector. Starting in 1964, Brazil 
engaged in one of the most comprehensive indexation programs 
anywhere. The financial sector was the first to be indexed. Exchange- 
rate indexation was introduced in August 1968.71 

During the postwar period, Brazil had experienced a variety of 
exchange-rate regimes. Until 1953, exchange controls and licensing 
were widely used. In 1953 an exchange auction system with multiple 
exchange rates was introduced. A process of unifying the rates was 
initiated in 1961, and since then the country has had a pegged ex- 
change rate. Since 1968, when the crawling peg was adopted, new 
selling and buying rates for the cruzeiro in relation to the U.S. dollar 
are announced by the central bank at frequent intervals (on average, 
the rate was changed over 10 times a year). 

Despite the fact that there are no explicitly stated rules for the 
crawl, it is evident that the method in effect applies a purchasing 
power parity clause to the domestic price of foreign exchange, and 
the evolution of both domestic and foreign price levels is taken into 
account in adjusting the exchange rate. In this way, as emphasized by 
the Brazilian authorities, it is possible to neutralize the harmful effects 
of domestic inflation on the competitiveness of Brazilian products in 
international markets and, hence, on Brazil’s balance of payments.” 

Although inflation differentials are certainly an important factor in 
determining the crawl, it was not the only one. However, it is evident 
from the data (see Lemgruber 1977, tables 1 and 6; and Moura da 
Silva 1977, table 2) that since 1968 the variations in the exchange rate 
have followed purchasing power parity rates closely, suggesting that 
exchange-rate changes have been mainly determined by the differ- 


20 Specifically, 7* was constructed from the fitted values of a regression of the rate of 
inflation on lagged values of itself, foreign inflation, domestic-credit growth, 
exchange-rate devaluations, and real income growth. For each of these variables two 
lagged values were included in the equation. 

21 On the various ‘aspects of indexation in Brazil, see Nadiri and Pastore (1977). A 
comprehensive study of the recent Brazilian economic experience is presented by 
Lemgruber (1977). On the functioning of the crawling peg and its effects on the 
economy, see Moura da Silva (1977). See also Lara-Resende (1978). 

# This appears to be the main justification put forward by the authorities for the 
adoption of the crawling peg; on this issue see Lemgruber (1977) and Moura da Silva 


(US 72). 
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ence between domestic and foreign inflation. The specification of 
the exchange-rate reaction function above (eq. [13]) 1s intended pre- 
cisely to capture this feedback process. Such a specification imposes 
restrictions on the data; econometric tests of these restrictions as well 
as the results from estimating the model are reported below. 


Empirical Results 


We now report the estimates of the parameters of equations (3') and 
(11')-(13'), as well as of the derived reduced-form equations. All 
estimates are based on quarterly data for Brazil. The sample period, 
11/1968 through IV/1977, contains 38 observations. Most of the 
variables are represented by readily available series.“ The results are 
presented in table 1. All the parameter estimates have the proper sign 
and are reasonably well determined; in most cases asymptotic ¢ ratios 
are well above 2.?° 

All the coefficients in the estimated money-demand equation (3’) 
have the expected signs and are significantly different from zero. The 
estimated coefficients are fairly similar to those reported for Brazil by 
Silveira (1973) and Khan (1977). Our parameter estimates imply a 
long-run income elasticity of real money demand of 1.307 and a 
long-run elasticity or real money demand with respect to expected 
inflation of —0.77. 

The results for (11’)—(13’) indicate that both foreign inflation and 
excess supply in the money market significantly affect the determina- 
tion of the domestic rate of inflation. These two factors, world 
inflation and monetary imbalance, also significantly affect the balance 
of payments, as shown by equation (12'). Specifically, an acceleration 
in the rate of domestic-credit creation over and above the rate of 
growth in money demand will increase domestic inflation and reduce 
international reserves. The overall effect of such an increase, how- 
ever, cannot be established solely from equations (11") and (12). This 


3 The exchange rule has been followed with a considerable degree of consistency. 
Almonacid and Pastore (1977) present some evidence to that effect and criticize the lack 
of flexibility—the rule has not been altered to deal with the oil crisis and with the 
mounting external debt of Brazil. Similar evidence is also presented by Connolly and da 
Silveira (1979, pt. 3), who found that purchasing power parity holds very well, particu- 
larly after 1962. 

4 The following series were used for P, R, and M: consumer price index, interna- 
tional reserves of the central bank, and the money supply; D was generated by sub- 
tracting R from the monetary base; p is the market exchange rate, selling rate (average 
over the quarter) (all the above were taken from International Financial Statistics, various 
issues); y is represented by the quarterly real income series constructed by Wachter 
(1976). (The series were extended through 1977 using Wachter’s method.) 

> Asymptotic ¢ ratios are defined as coefficient values divided by asymptotic standard 
errors. 
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TABLE 1 


FULL-INFORMATION MAXIMUM-LIKELIHOOD ESTIMATES OF THE MODEL 








(Si) log m; = 0.162 log y, — 1.700 wf + 0.876 log my-1, 
(0.049) (0.229) (0.045) 
MSE = 0.0018, @, = —0.613, C, = 0.873; 
(0.042) (0.297) 
(LE) P¥ = 0.958(P* + p*), + 0.042(a* + yD* — mi), 
(0.018) (0.018) 
MSE = 0.0003, 6, = —0.082, C, = 0.002; 
(0.110) (0.0012) 


(12’) (i= y)Ri = 0.958(P* + pi me — yD* — a*),, 


(0.018) 
MSE = 0.0021, #3 = 0.251, C; = —0.038; 
(0.045) (0.039) 
(13’) — p* = 0.526(P* — P#), + 0.249(P* — Px),_, + 0.118(P* — P*), », 
(0.106) (0.005) (0.029) 
MSE = 0.0005, 6; = —0.050, C, = —0.008; 
(0.106) (0.003) 
0.0018 
r= | —0.0004 0.0006 
0.0009 0.0004 0.0029 
0.0002. —0.0005 —0.0005 0.0005 





Nore.—See text for notation. Numerals in parentheses are asymptotic standard errors of regression coefficients; 
T is the estimated variance-covariance matrix of residuals; C,; ( = 1, 2, 3, 4) is the estimated constant term; @; is the 
autoregression coefficient; and MSE is the mean square error. 


is so because the divergence of domestic from world inflation will, as 
indicated by equation (13’), depreciate the exchange, which in turn 
will further increase the rate of inflation and reduce the rate at which 
foreign reserves are depleted. In order to analyze the full effect of 
changes in the rate of growth of domestic credit (as well as in the other 
exogenous variables), reduced-form coefficients should be con- 
sidered. These coefficients are reported below. 

The parameter estimates in equation (13’)—the exchange-rate 
reaction function—indicate that more than 50 percent of the 
foreign-domestic inflation differential is transmitted to the exchange 
rate within the current quarter, and an additional 25 percent in the 
next. After only three quarters the full differential is completely 
reflected by the, exchange-rate adjustment. In this context, it is im- 
portant to note that equation (13) embodies a restriction regarding 
the effects of current and lagged values of the domestic-foreign 
inflation differential on the exchange rate. Specifically, a pattern of 
geometrically decaying weights was postulated. The results reported 
in table 1 are derived under the imposition of this restriction. To test 
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TABLE 2 


DERIVED REDUCED-FORM ESTIMATES 








(14) p¥ = (0.045 + 0.43L + 0.41L? + 0.039L3 + 0.037L4)(yD* + a* — mg — Pi), 
(0.018) (0.017) (0.015) (0.014) (0.012) 


(15') P# = (0.915 — 0.041L — 0.039L? — 0.037L* — 0.036L*)(Pit)¢ 
(0.030) (0.016) (0.014) (0.013) (0.012) 
+ (0.085 + 0.041L + 0.039L? + 0.037L3 + 0.036L4)(yD* + a* — mz)s, 
(0.030) (0.016) (0.014) (0.013) (0.012) 
(16’) (1 — y)R# = (—0.915 + 0.041L + 0.039L? + 0.037L? + 0.036L4)(yD* + a* 
(0.030) (0.016) (0.014) (0.013) (0.012) 


* 
at?) ieecile) oe 





Note.—For notation and explanation, see text. 


its empirical validity, we have estimated the system unrestricted, anda 
x? likelihood ratio test was constructed, giving a value of 2.114 with 2 
df. This is less than the 95 percent x? critical value; that is, the 
sample information fails to reject the restrictions embodied in (13’) 
at the usual 5 percent significance level. 

With the information contained in table 1 it is possible to calculate 
the derived reduced-form estimates and their asymptotic standard 
errors. The model’s reduced forms are given by equations (14)—(16), 
whose estimation forms include changes in the money multiplier, a*. 
These estimates, as implied by the structural estimates of table 1, are 
presented in table 2 ({14']-[16’]), from which the overall effects of 
current and lagged rates of excess money supply and of foreign 
inflation on the exchange rate, the domestic rate of inflation, and the 
balance of payments can be assessed.2® The coefficients indicate that 
about 25 percent of the excess flow supply of money (the difference 
between the rates of growth of domestic credit and money demand) is 
transmitted to the inflation rate within five quarters. To the extent 
that the rate at which the excess supply of money is created exceeds 
the foreign rate of inflation, the exchange rate will depreciate (by 
about 15 percent of the difference, during the first five quarters) and 
international reserves will fall. Monetary equilibrium is therefore 
maintained by a combination of reductions in the real value of the 
nominal stock (due to the acceleration in the inflation rate) and 
reductions in the monetary base (through the loss of reserves). 

The effects of foreign inflation on the three endogenous variables 
can also be analyzed from the estimates of table 2. A higher rate of 
world inflation will reduce the rate of exchange depreciation and will 


6 Although only current and four lagged coefficients are reported, the coefficients 
for additional lags can be calculated from the information provided in table 1. 
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have a positive effect on the rate of accumulation of foreign reserves. 
It will, however, strongly and significantly raise the domestic rate of 
inflation during the current quarter. Note that lagged rates of foreign 
inflation appear with a minus sign, which is indeed consistent with our 
version of the crawling-peg regime. An acceleration of world inflation 
will, ceteris paribus, induce subsequent appreciations of the exchange 
rate, thus moderating the initial increase in both the domestic price of 
traded goods and the domestic rate of inflation. 


II. Concluding Remarks 


The purpose of this study has been to construct and test a model that 
accounts for the joint determination of the exchange rate, the rate of 
inflation, and the balance of payments in a crawling-peg economy. 
The model presented extends the previous literature on the monetary 
approach to the case of a crawling-peg regime. 

According to the model, a small open economy that indexes its 
exchange rate through the adoption of a purchasing power parity 
clause may choose its rate of inflation independently from the rest of 
the world. To the extent that purchasing power disparities arise 
because full adjustment of the exchange rate is not instantaneous, 
equilibrating flows of international reserves will take place. Thus the 
model is capable of generating a pattern of short-run deviations from 
purchasing power parity which occur simultaneously with movements 
in international reserves and the exchange rate. 

Our theoretical framework can be used to analyze the effects of 
domestic monetary policy and of external inflation on a crawling-peg 
economy. For example, we have shown that an increase in the rate of 
domestic-credit creation will generally tend to raise domestic inflation, 
reduce international reserves, and depreciate the exchange. The 
exact path of adjustment will depend on a number of parameters 
explicitly incorporated into the model. The faster the adjustment of 
the exchange rate to purchasing power disparities, for example, the 
greater the impact effects of domestic monetary variables on the rate 
of inflation and the smaller their impact effect on the balance of 
payments. Similarly, domestic monetary variables will have a greater 
impact effect on domestic inflation, and a smaller impact effect on the 
balance of payments, the higher the share of nontraded goods in 
expenditures and the higher the elasticity of the traded/nontraded 
relative price with respect to these monetary variables. 

The predictions of the model regarding signs and magnitudes of 
the different parameters appear to be sustained by the Brazilian data. 
Overall, the empirical evidence presented seems to indicate that our 
theoretical framework is consistent with the sample information. 
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Appendix 


In this Appendix we derive the structural and reduced-form equations of the 
model when persistence effects are allowed in the price equation for non- 
traded goods. The structural equations using this specification are then esti- 
mated and the results are reported below. 

Replacing equation (8) by equation (9), we can express the rate of change of 
nontraded goods prices as 





(Pir) = (P#), + QD. (Al) 


0 
= ail 
Adopting now the definition of Q given in Section I, we obtain equation (A2), 
the counterpart of equation (10), 

0 
ics 


Using equations (7) and (A2), the rate of domestic inflation, equation (6), can 
be written as 





(P¥r)t = (P#), + (yD* el oie Mi)t- (A2) 


P¥ = €'(P& + p), + (1 — €')(yD* + a* — ma), (A3) 
where e’ is now a polynomial in the lag operator, 
p fee ee ees le (A4) 


1—oal + 6(1— A). 


Postulating the same crawling rule as used in the text, that is, equation (13), 
we can obtain the reduced forms of the model,”” 


ees ier uae ee 
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(1 — y)RF = eed a (P& + mi — yD*),. (A7) 


The econometric forms of the model, in conjunction with the demand for 
money, equation (3’), are estimated by the procedure described in Section IT. 
The parameter estimates do not differ much from the model in which this 
type of persistence is ignored. 

For comparison and completeness we now report the estimates obtained 
with the extended model: 


27 The reduced forms could be expressed in terms of the underlying parameters, A, 
a, and @ by substituting the RHS of (A4) for e’. This results in rather cumbersome 
expressions, as, e.g., in eq. (A5): 





ss BO — d) & arene 
P2 esata Aad ec ere 


CRAWLING-PEG SYSTEM 149 


log m, = 0.224 log y, — 2.2247 + 0.828 log m;_,, (A3') 
(0.063) (0.300) (0.058) 
pF = 0.618(P* — P*), + 0.236(P* — P*),_, + 0.090(P* — P*),_,, (A8) 
(0.099) (0.023) (0.032) 
P* = 0.951(P* + p*), + 0.049(a* + yD* — m2), (A9) 
(0.018) (0.018) 
(1 — y)R#F = 0.951(P* + p* + mz — yD* — a*),. (A10) 
(0.018) 
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Does Federalism Matter? 
Political Choice in a Federal Republic 





Susan Rose-Ackerman 


Yale University 


This paper builds upon some well-known facts about state govern- 
ment to generate new conclusions about social choice on the national 
level of a federal republic. Citizens vote against national laws that 
restrict their state’s ability to export costs but support laws that 
reduce the costs imposed on them. Individuals may seek to extend 
the laws passed in some states to the entire nation or may oppose 
preemptive laws because they benefit from variety. Since these moti- 
vations are absent in a unitary system, national support for a law will 
depend upon whether a unitary or a federal structure prevails. 


I. Introduction 


This paper builds upon some well-known facts about state govern- 
ment to generate new conclusions about social choice on the national 
level of a federal republic. The central feature of state government 
behavior I shall exploit is the incentive each state has to improve its 
own position by imposing costs on the residents of other states. This 
search for local advantage takes many forms. On the one hand, states 
may impose taxes and regulations which are borne by out-of-state 
residents; on the other hand, they may try to attract investment from 
other states by providing tax breaks and special public services." 


This paper was presented at the Workshop on Analytical Urban Economics at 
Queen’s University, Kingston, June 1978. I wish to thank Bruce Ackerman, the work- 
shop participants, and the referee of this Jowrnal for helpful comments. 

1 These incentives are stressed by Walker (1969, p. 890) and Posner (1977, chap. 26). 
McLure (1967) estimates that about one-quarter of all state taxes are exported. Exter- 
nalities such as air and water pollution are emphasized by Breton (1965), Olson (1969), 
Tullock (1969), Rothenberg (1970), and Oates (1972). These authors discuss how a 
federal system can trade off interjurisdictional variety against the internalization of 
externalities through boundary definition, the assignment of functions to levels of 
government, and intergovernmental grants. 


[Journal of Political Economy, 1981, vol. 89, no. 1] 
© 1981 by The University of Chicago. 0022-3808/81/8901-0008$01.50 
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Whatever their particular character, these interstate spill-ins and 
spill-outs can alter the substance of national legislation. Citizens will 
vote against laws that restrict their state’s ability to export costs but will 
support laws that reduce the costs imposed on them by their own and 
other states’ choices. Since these strategic motivations are absent in a 
unitary system, national support for particular laws will depend upon 
whether a unitary or a federal structure prevails. 

Despite its potential importance, the impact of federalism on the 
strategic position of voters has been largely ignored by social scien- 
tists. In his excellent review of the literature, for example, William 
Riker (1975) argues that federalism has no important influence on 
substantive national outcomes, suggesting that it does no more than 
delay the passage of national legislation opposed by a substantial 
minority of state legislatures.’ Although Riker’s conclusion is partly an 
empirical proposition, it is also based on an implicit model of how 
government systems behave. Riker argues (1975, pp. 155—56) that the 
underlying distribution of tastes in the population determines politi- 
cal outcomes. Since federalism per se has no impact on tastes, it will 
not, therefore, affect national political choices in the long run. The 
present paper challenges these theoretical underpinnings. 

Since I am concerned not with an exhaustive taxonomy but with the 
relationship between political power and political structure, I con- 
centrate on simple models. The system I discuss has only two 
“layers” —national and state.* States have fixed boundaries and cover 
the nation’s entire geographical area so that every citizen lives in only 
one state. I will contrast the legislative choices of a unitary govern- 
ment with those of a “hierarchical” federal system where higher-level 
governments can always preempt the legislative choices of lower-level 
governments. If the superior government has taken no affirmative 
action, however, the statutes of inferior governments are binding.’ I 


* Riker (1975) proposes an imaginary experiment in which matched pairs of unitary 
and federal systems are examined to discover if there are important public policy 
differences. He hypothesizes that public policy within each pair will be “remarkably 
similar regardless of federalism” (p. 144). Furthermore, he argues that in the United 
States federalism delayed national regulation of business (p. 154) and helped per- 
petuate racist acts (pp. 154—56) but had no long-run impact on the character of national 
legislation. 

* For an attempt to justify a three-layered system, see Ylvisaker (1959). With the 
exception of Wechsler (1963), Posner (1977), and Winter (1977), legal commentators 
have all but ignored the issues discussed in this paper and have instead concentrated on 
the relationship between state and federal courts. For a recent article in this tradition, 
see Cover and Aleinikoff (1977). 

4 This model should be contrasted with others which have a strict division of func- 
tions between high- and low-level governments. For example, see Wheare (1953, pp. 
32-33). Recently, concern for a strict division of authority has given way to scholarship 
which recognizes the importance of interactions between levels of government. See, for 
example, Grodzins (1960, 1966) and Elazar (1962). 
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assume that both state and national governments are direct democ- 
racies where the only political actors are individuals and where all 
decisions are made by majority vote. Political issues are separable and 
well defined, and they appear to voters as simple dichotomous choices 
between maintenance of the status quo and a change.” The analysis 
concentrates on legislative choices. I do not discuss administrative or 
judicial issues or the possibility that the national government will seek 
to administer programs through state government agencies. 

Section II shows that federalism “matters” in a political system 
where capital can move but people cannot. Section III considers the 
impact of permitting interstate migration of voters, and Section IV 
shows that federalism can matter even in the long run when states 
respond to the legislative choices of other states. 


II. Federalism Does Matter 


The major difference between a unitary system and a hierarchical 
federal structure is the characterization of the status quo. Let y = the 
status quo in a unitary system, and let x = a proposed, exogenously 
defined, legislative change. Suppose, for example, that in the status 
quo casino gambling is illegal. The federal government, however, 
proposes to make such gambling legal and to levy a tax on the 
profits. 

In a hierarchical federal system, some states have passed legislation 
that is similar to x. Others have a status quo that is similar to y. Let us 
call the former x-type states and the latter y-type states. Continuing 
the example, x-type states permit casino gambling and levy a tax on 
earnings, while casino gambling is illegal in y-type states. Assume that 
if all states are x-type states, then the state laws taken together would 
have the same impact as x. Similarly, assume that if no states have 
passed the x-type law, the status quo is the same as in a unitary system. 
Thus, in my example the state gambling laws are duplicates of x and y, 
respectively. This assumption is not essential to the analysis, but it 
simplifies the exposition considerably. 

Because of interstate spillovers of costs and benefits, individuals’ 
preferences over x and y will not necessarily be the same as their 
preferences for alternative state legal regimes. In the gambling 
example, some people may favor a statewide gambling law and vote 


» These assumptions permit me to look at the vote on a single issue and to avoid voting 
cycles in which majority rule does not produce a determinant outcome (Sen 1970, p. 
38). The actual relationship between state statutes and federal law in the same area is, of 
course, considerably more complex. In reality, the relationship is seldom obvious and 
federal courts are frequently called upon to sort out the overlapping authority of state 
and federal statutes (see Tribe 1978, pp. 378-86). 
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against x at the national level. A state’s residents gain when out-of- 
staters come to gamble and consume tourist services. The state would 
lose this advantage if gambling were legal in every state. In a unitary 
system, however, these same individuals might favor x over y since 
they would have no special advantage to protect. 

A voter’s preference for x over a system with a variety of state laws 
will often depend upon the number and location of x-type states. The 
economic advantages of permitting gambling in one’s state of resi- 
dence are larger the smaller the number of other states which also 
permit gambling. The benefits are also higher the more geograph- 
ically distant are the other jurisdictions with legalized gambling. 
Let g; = z; if state 7 has passed an x-type law, g; = w; if state i is a y-type 
state, and let H = total number of states. Then, the status quo ina 
federal system is a vector G = (1,--- Zi, --+,8n) where g; = z; or uj. 
Thus we can summarize the situation of those living in x-type states as 
z(G) and the situation of those in y-type states as w(G). 

In this portion of the analysis, I assume that migration across state 
lines is impossible, so that the only way individuals can affect the legal 
regime under which they live is to vote for new state or federal laws. 
Capital, however, is free to move between jurisdictions. Given these 
assumptions, we can now compare the way voters with different 
preferences will cast their ballots in federal and unitary republics. 
Turning first to a unitary system, individual preferences can be 
straightforwardly translated in a social choice. Law x passes if and only 
if: 


N(xRy) > N(yRx), (1) 


where R is the binary relation of “weak preference” (“at least as good 
as”). Then N(aRb) is the number of people for whom aR,b where R; is 
the weak preference relation for individual j and R; is reflexive, 
transitive, and complete.® 
In a hierarchical federal system, under the assumption that only the 
current level of G is relevant, x passes at the national level if and only 
if: 
N,[xRz(G)] + No[xRw(G)] = N,[z(G)Rx] + N[w(G)Rx], (2) 


where N,(aRb) is the number of people living in x-type states for 
whom akb and N,(aRb) is the number of people living in y-type states 
for whom ak,b. 

Expressions (1) and (2) need not yield the same results. Individuals 
who would prefer x to y need not also prefer x to z(G) or w(G). If a 


® In Sen’s terminology (1970, p. 9), R; is an ordering. The notation in (1) follows ‘Sen 
(970 spawdl): 
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to x types) to y types) 





state is able to export the costs of a program, then its citizens are likely 
to oppose preemptive federal laws even though they favor x-type 
legislation in their own state and would favor x over y in a unitary 
system. Similarly, if citizens can benefit from the legislative initiatives 
of other states then they may vote against x although they have xRjy 
with a unitary government. The gambling example falls in the first 
category. Public health or pollution control laws in neighboring states 
may produce the second voting pattern. In fact, individual prefer- 
ences may produce any of the possible rankings ofx,y,andz(G) andx, 
y, and w(G); and one can thus divide the population into groups 
depending upon their preferences and the states in which they live. 
For a givenG, let A(G) = set of people in x-type states and let B(G) = 
everyone else. 

People living in x-type states have xRz(G) or z(G)Rx or both. Those 
in set B(G) have xRw(G) or w(G)Rx or both. In a unitary system voters 
have xRy or yRx or both. Table 1 is a matrix of the possible taste 
combinations. The entries in the matrix are the sets of people in each 
taste class. Thus 4(G) = 4,(G) U A,(G) U A3(G) U Ag(G) and B(G) = 
B,(G) UB,(G) U B;(G) U B,(G)." In expression (1), N(xRy) = N(A, U 
A; U B, U B3). In expression (2), Ni[xRz(G)] = N,(A; U Az) and 
N.[xRw(G)] = N2(B, U B,). Thus given G, the vote for x in a unitary 
system, N(xRy), is greater than, equal to, or less than the vote in a 
hierarchical system, N,[xRz(G)] + No[xRw(G)], as: 


N(d43 U Bs) 2 N(Ap U Bi). (3) 


Expression (3) shows that federalism “matters.” The size of the vote 
in favor of the national law may be higher or lower in a hierarchical 
federal system than in a unitary system. A law which passes in one 
system may be defeated in another. The practical importance of this 
result, however, depends upon whether sets 43, B; and A», By repre- 
sent common preference configurations. Sets 43 and B; include indi- 


“The symbol U stands for the union of sets, i.e., the set of people belonging to either 
set. 
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viduals who prefer x to y as the national law in a unitary system but 
who each favor conditions in their own states over x.8 Sets A, and B, 
are the reverse of A; and B3. They include individuals who would 
Oppose x in a unitary system but favor preemptive legislation because 
it keeps them from losing at the expense of other citizens. 

If sets 4, and By are large relative to sets 4; and B;, thena paradoxi- 
cal result is possible. Although federalism is often justified as a way to 
preserve diversity and prevent the centralization of power, (3) implies 
that the existence of low-level governments may cause some voters to 
favor central control of an activity that they would otherwise have 
wanted unregulated. Without constitutional limits, a majority of citi- 
zens may be so unhappy with the mixture of independent state 
choices under federalism that they favor a more powerful central 
government than they would in a unitary system. 


III. Migration and the Vote for National Laws 


When voters can move between jurisdictions in response to public 
policy changes, the vote for national preemptive laws will change. If 
migration is costless and if legal regimes are the only determinant of 
location, then, given G, national electoral support would fall in a 
hierarchical federal system. Voters are less likely to want national 
uniformity when they can move to congenial states. 

Reality, of course, is somewhere between the extremes of a per- 
fectly fixed population and a perfectly mobile one. Furthermore, 
different types of people have different moving costs, and the relation 
between these costs will help determine the political power of various 
groups at the state level. Thus, mobile groups may impose costs on 
immobile groups as a price for not leaving the state. Alternatively, 
undesirable groups may be induced to migrate by taxing or regulating 
them or by subsidizing their moving expenses. Variations 1n the cost 
of migration change the strategic environment. Those who can move 
cheaply have an advantage over those who cannot. Therefore, na- 
tional decisions which require uniformity will be strongly supported 
by those who lose from the migration of others and opposed by those 
whose mobility permits them to gain at the expense of less mobile 
voters. 

The possibility of migration has implications for national programs 
which seek to redistribute income from high-income to low-income 
families. Economic analyses of multiple government systems (e.g., 
Olson 1969; Oates 1972) conclude that interjurisdictional mobility 


8 ft also, of course, includes those who are indifferent between any of these alterna- 
tives. Only people who are indifferent to all three alternatives are in both A, and A; or 
in both B, and Bs. 
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makes serious redistribution impossible at the state level. Progressive 
tax and spending policies must be carried out by preemptive laws at 
the national level. Interstate migration, however, also lowers the 
prospects for passage of national redistributive laws. If voters are 
mobile, support for a national preemptive law which redistributes 
income from the rich to the poor may be less than the support such 
legislation would obtain in a unitary system or in a federal system with 
no migration. Many who would support a particular progressive tax 
and spending program in a unitary system may oppose it in hierarchi- 
cal federalism if the law preempts state “beggar-my-neighbor” laws 
which benefit these citizens. Selfishness will dominate altruism if the 
opportunity cost of altruism is too high. 


IV. Does Federalism Matter in the Long Run? 


The result in (3) applies when the number of x-type and y-type states 
is held constant (i.e., for a given G). One student of federalism (Riker 
1975), however, argues that in the long run a federal system and a 
unitary one will converge to the same pattern of national laws. The 
view that federalism does not matter in the long run is based either on 
a hypothesis about how tastes change over time® or on a model that 
makes particular assumptions about the way votes change in response 
to new information and new circumstances (i.e., changes in G). The 
cases where federalism does not matter are contingent upon these 
assumptions. This section shows that it is possible to impose other 
plausible conditions on tastes, opportunities, and government struc- 
ture and produce an equilibrium where x- and y-type states coexist in 
spite of majority support for x over y. 

My analysis of state legislative choices challenges the idea that states 
are “lighthouses” that show the way to the federal government by 
enacting innovative laws. In my model, a law may spread to many 
states, not because it has been “tested” and found useful,’ but rather 


* Riker’s argument turns on the overriding importance of individual tastes in deter- 
mining political outcomes. He uses the example of white racism. “As long as whites 
strongly prefer racist institutions, one can expect institutions to be racist regardless of 
whether the country is federal or unitary. But when the preference for racist institu- 
tions weakens, then federalism helps racism by rendering difficult the enforcement of 
an anti-racist policy on the minority of white racists. So we can say that the beneficiaries 
of federalism get only marginal benefits on policy, but marginal or not, they are 
undoubtedly real” (1975, p. 156). 

See Walker (1969, 1971, 1973). By his choice of words, Walker appears to believe 
that early adopters of laws are progressive. He calls them “pioneers” (1969, p. 881). A 
glance at Walker’s list of laws should warn anyone against this inference. Many involve 
the licensing of occupations such as barbers or real estate brokers. Others simply 
mandate the establishment of state agencies, many of which are required as a condition 
for receiving federal grants. For a different perspective on the same issue which 
emphasizes politicians’ incentives to take risks, see Rose-Ackerman (1980). 
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because voters in y-type states want to avoid damage at the expense of 
others. Similarly, a national law may eventually pass in a federal 
system, not because a new initiative has been tested in the states, but 
rather because voters want to override the costs of spillovers and 
inconsistent laws. 

Since a fully general analysis would be difficult to interpret, I 
concentrate on three special cases that capture the essential features 
of many actual situations. In these examples, I assume that tastes do 
not change over time, that voters have similar initial information, and 
that benefits that spill over to people in one group of states are costs to 
people in the other. In case 1, voters in x-type states receive positive 
spill-ins from out-of-state residents, and voters in y-type states bear 
costs. In case 2, voters in x-type states receive negative spill-ins from 
y-type states, and voters in y-type states benefit at the others’ expense. 
Finally, in case 3, voters’ beliefs in “states’ rights” may conflict with 
their substantive position on other issues. 


Case I: Positive Spill-ins to x-Type States 


Examples of case | are casino gambling or state-run lotteries in states 
where voters are concerned only with tax revenues and jobs. Taxes 
imposed on products sold to out-of-state consumers also fall into case 
1. In these situations, A, and B; are empty sets.'! Thus (3) becomes 
N(A;) 2 N(B2). 

Since an x-type law leads to positive spill-ins, it is plausible to 
suppose that over time more and more states pass x-type laws (i.e., the 
set G changes). A few states institute lotteries, for example, and 
eventually many more follow their example. If this happens, people 
in set B now join set A. Many who favored a national lottery (i.e., those 
in B, and B,) may now oppose it as a way of preserving their newly 
acquired positive spillovers (i.e., they move to A; and A,, respectively). 

As more states pass x-type laws, however, the level of net positive 
spillovers to each x-type state falls. In the case of casino gambling and 
lotteries, a state’s consumers are less likely to gamble in other states if 
their own state has an x-type law.’” In addition, negative spillovers to 
y-type states increase as more and more states export costs to them. 
The decline in net benefits and the increase in net costs as G changes 


1! Voters in A, favor preemption but oppose x in a unitary system. Voters in B, 
oppose preemption but favor policy x in a unitary system. Both of these preference 
patterns seem to be implausible if x-type states benefit at the expense of y-type states. 
This case would not hold if some people think that gambling brings costs (i.e., crowds, 
corruption). 

2 Similarly, tax incentives to encourage industry to locate in a jurisdiction provide 
few benefits to taxpayers if most states provide equivalent incentives. For an attempt to 
show the costs of interstate competition for business investment, see Jacobs (1979). 
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will raise the vote for a national preemptive law in individual x- and 
y-type states. Thus, a federal structure generates two influences that 
operate in opposite directions. A person whose state institutes a lot- 
tery is more likely to oppose federal preemption. However, as the 
number of states with lotteries increases, the benefits of multiplicity 
fall. If the first factor dominates the second, federalism will continue 
to matter even in the long run. 


Case 2: Positive Spillovers to y-Type States 


Examples of case 2 are minimum wage laws" or fair labor statutes that 
increase job opportunities in states without such laws and state pollu- 
tion control laws where water and air cross state lines. Since in case 2 
federalism permits y-type states to gain at the expense of x-type states, 
A; and B, are implausible preference patterns. Thus (3) becomes 
N(B3) 2 N(A}). 

_The majority of voters in x-type states have decided that, in spite of 
the costs, the law is worth having. Suppose that as time passes, voters 
in y-type states learn from the experience of those who have x-type 
(e.g., minimum wage) laws. If the benefits are greater than expected, 
then eventually more states pass minimum wage laws. People whose 
states enact such laws are now more likely to favor federal preemp- 
tion. They no longer obtain the benefits of spillovers from other 
states, and they might like to prevent other states from benefiting at 
their expense. However, as the number of x-type states increases (G 
changes), the costs imposed on each x-type state fall and the benefits to 
each y-type state rise. This shift in costs and benefits should reduce 
support for a preemptive law in individual x- and y-type states. If the 
second (change in G) effect outweighs the first (change from B to A) 
effect, a national minimum wage law may not pass in a federal system 
even though many states have passed their own laws and N(xRy) > 
N(yRx). Historical research, however, has stressed the cases where 
state initiatives led eventually to a federal statute, leaving un- 
analyzed the many situations where the adoption of a law by many 
states has not been followed by a preemptive federal initiative." 


'® Riker (1964, p. 146) uses the example of minimum wage laws that differ across 
states: “There is then much likelihood of capital flow from the high-wage localities to 
the low-wage localities for all those industries in which labor represents a high propor- 
tion of the cost. Aside from the imposition of a nationally uniform minimum wage, the 
only way that high-wage localities may counter this capital flow is by reducing the 
minimum wage level.” 

4 See Fine (1956, pp. 353-85), who discusses the examples of antitrust laws, pure 
food and drug laws, protective labor legislation, social security, and welfare. 

* Many of the laws studied by Walker (1969) have spread across many states but have 
never been adopted nationally. It is possible that some of these statutes might have 
majority support in a unitary system. 
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In this case, some people put their belief in states’ rights above their 
position on x (e.g., the abolition of slavery) and are in A; and By. Thus, 
no one favors imposing x on all states if they also oppose x in a unitary 
system (i.e., 4, and B, are empty sets). Therefore, (3) becomes N(A; 
iB sO. 

The vote for x in a federal system is certain to be less than the vote 
in a unitary system, and federalism may continue to matter so long as 
preferences remain constant.'® Assume, however, that people with a 
substantive preference for x over y favor states’ rights only if the 
results of independent state choices are not too far away from their 
position on x. Then, it is easy to tell a simple story where an unantici- 
pated change leads to convergence of a federal and a unitary system. If 
the number of y-type states (e.g., slave states) increases exogenously, 
people who are opposed to slavery may now choose federal preemp- 
tion over states’ rights (i.e., some of those in 4; and B; move to 4, and 
B,). Therefore, x may now pass at the national level. An exogenous 
change that seemed to favor y leads in fact to its repudiation by the 
nation. Thus, it may not be necessary to assume a growth in Northern 
antislavery sentiment to explain the evolution of pre—Civil War poli- 
tics. As slavery spread across the South after the invention of the 
cotton gin, even people whose antipathy to slavery was constant might 
have supported a national policy of abolition if they thought that 
more and more states would permit slavery (i.e., they might shift from 


- 16 For symmetry, we could include a fourth case that appears to be of less empirical 
importance. In this final case, people believe in uniformity or in a strong central 
government and are willing to support a national law even when they would oppose the 
law in a unitary system. Thus, 4; and B; are empty sets, (3) becomes 0 < N (dy U B,), and 
the vote for x is always larger in a federal system than in a unitary system. If is a status 
quo which permits the private market to operate with a minimum of government 
interference, then case 4 is illustrated by business managers who favor laissez-faire 
(yRx) but if faced with a mixture of differing state laws would rather have a uniform 
federal regulatory statute because it permits them to reduce costs. Employers who sell 
their products in many states often fit this preference pattern. They would rather not 
be regulated at all, but being regulated in some states is worse than a uniform national 
standard. For example, sellers of bottled mineral water are beginning to face a variety 
of labeling laws enacted by different states. According to Business Week: “Most bottlers 
say they would not oppose reasonable, uniform regulations. . . . Industry executives 
shudder at the prospect of trying to comply with rules that could vary widely among 
states” (“Mineral Water Could Drown in Regulation,” Business Week [June 11, 1979)). 
State trucking regulations are sometimes inconsistent and costly for truckers. Thus, 
state regulations of the mudguards required on interstate trucks were inconsistent, and 
an Illinois law was overturned by the Supreme Court as interfering with interstate 
commerce (Bibb v. Navajo Freight Lines, Inc., 359 U.S. 520 [1959], discussed in ‘Tribe 
[1978, p. 339]). Similarly, the Supreme Court ruled that Wisconsin’s prohibition against 
twin trailers on interstate highways was an unconstitutional interference with interstate 
commerce (“Breaking a Bottleneck in Long-Haul Trucking,” Business Week [March 7, 


1978)). 
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A; to A,). When slavery looked as if it might expand into the western 
states, people who had favored a federal solution might begin to 
support abolition.” 


V. Conclusions 


Political economists have generally recognized that realistic political 
systems must include some interjurisdictional spillovers as a cost of 
providing citizens with a choice of public service levels. Intergov- 
ernmental grants may reduce these interjurisdictional costs (Breton 
1965) but will not eliminate them. Because both the dispersion of 
tastes across the population and the level of externalities differ for 
different public services, some analysts have argued that a federal 
system is the best way to accommodate these conflicting tendencies 
(e.g., Oates 1972). In making this recommendation, however, these 
authors fail to make clear an important idealization inherent in their 
analysis. They assume that it is possible to assign functions unambigu- 
ously to levels of governments so that constitutional structure only has 
an impact on spillovers and on the position of minorities. In fact, it 
will often be impossible to assign responsibilities neatly to a particular 
political level. Indeed, this overlap is the characteristic feature of 
contemporary “cooperative” federalism. Given this fact, the present 
paper has shown that, when authority is divided, the choices of 
lower-level governments can have important consequences for the 
decisions of higher-level governments. Even when the central gov- 
ernment has the power to preempt state and local laws, its democratic 
choices will depend upon the strategic position of citizens living under 
alternative state legal regimes. The essential difference between a 
hierarchical federalism and a unitary system is the difference in the 
status quo. This difference affects the vote on national legislation and 
the bargaining power of individuals. Individuals may vote against the 
extension of a state law to the nation as a whole even though they 


‘’ Potter (1976), in a history of the period from 1848 to 1861, stresses the difficult 
trade-offs faced by many people. He writes that “the problem for Americans, who, in 
the age of Lincoln, wanted slaves to be free was not simply that Southerners wanted the 
opposite, but that they themselves cherished a conflicting value: they wanted the 
Constitution, which protected slavery to be honored, and the Union, which was a 
fellowship with slaveholders to be preserved” (pp. 44—45). The Northern public “placed 
their antislavery feelings in a context of state action, accepting personal responsibility 
for slavery within their own particular states” (p. 46). In contrasting the position of 
Stephen Douglas in 1854 with that of his opponents, Potter shows how these trade-offs 
were resolved in different people. “Douglas was a vigorous believer in the democratic 
principle of local autonomy, but his opponents were equally vigorous believers in the 
moral primacy of freedom. . . . Douglas cared more about the Union than about the 
eradication of slavery and would never push the slavery issue to a point where it 
imposed too much strain upon the Union. Many antislavery men thought the Union 
hardly worth preserving so long as it had slavery in it” (pp. 172-73). 
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would favor the law in a system with only one government. They may 
wish to extend a state law to all citizens although they would oppose 
the law in a unitary system. 

The analysis suggests a promising area for future empirical re- 
search. Congressional votes on particular issues might be associated 
with existing state laws,'* and the timing of federal passage of laws in 
several areas could be related to the patterns of state adoptions of 
similar laws. Empirical work is also needed on specific policy areas. 
For example, one might study the impact of local power over schools 
on state and federal education policy or see how federal efforts to 
reduce poverty have been conditioned by prior state and local efforts. 

My discussion of federalism also suggests that, at the point of 
constitutional choice, people are more likely to support a federal 
system with strong lower-level governments, the fewer the strategic 
possibilities open to individual states. Strong low-level governments 
will be attractive if people are grouped geographically by their taste 
for public services and if it is difficult to impose costs on other 
jurisdictions. This implies that many people would support constitu- 
tional constraints on state governments that limit the states’ strategic 
behavior while retaining many of the benefits of variety and ex- 
perimentation. I would also expect that, if interjurisdictional migra- 
tion is possible, then those who favor aid to currently disadvantaged 
groups at the expense of mobile capital and labor resources will favor 
a strong central government.’® It is a commonplace in economic 
analyses of federalism to note that low-level governments cannot 
carry out redistributive policies (e.g., Oates 1972). My point, however, 
goes beyond this conclusion to the observation that if states try to gain 
at the expense of other states, then interstate redistribution can occur 
that bears little relation to anyone’s notion of social justice and may 
end up making all households worse off. The end result depends 
upon each state’s strategic position, that is, its ability to export taxes 
and import benefits. 
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The demand distribution is employed as a novel and powerful basis for 
analyzing monopoly provision of price-excludable public goods (and 
could be used to analyze other market structures or, in some circum- 
stances, private goods provision). For (uniform) per-unit, all-or- 
none, two-part, and multipart pricing, we identify: characteristics of 
revenue functions, relative profitability, and operational procedures 
for selecting price-output levels. Under all these strategies, rationing 
of some consumers by output is required, and a positive price-output 
relationship may arise. In general the revenue ranking of pricing 
strategies is sensitive to the distribution of demand and, though 
multipart pricing can be expected to be revenue dominant, uniform 
per-unit pricing emerges as a surprisingly robust strategy even be- 
fore operational complexity is fully taken into account. 


Recent contributions to the economics literature reflect a growing 
interest in private provision of commodities which possess one 


Work on this paper was completed while both authors were in the Department of 
Economics, Monash University. The notion of the demand distribution, to our knowl- 
edge, was first suggested in earlier unpublished work of the first-mentioned author. We 
have benefited from comments of participants in a Monash workshop, a Virginia 
Polytechnic Institute seminar, and the 1978 Public Choice meetings. John Head, Yew- 
Kwang Ng, Geoffrey Brennan, and David Friedman provided useful observations, but 
our greatest debt is to William Oakland, whose extensive and perceptive comments 
have been of considerable assistance. The authors nonetheless accept the conventional 
attribution of errors to themselves. 
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characteristic of a public good (“jointness,” or “nonrivalness in con- 
sumption”) but not the other (“impossibility of exclusion”): Exclusion 
is assumed to be commercially feasible, so that anyone unwilling to 
pay the required price can be denied access to any or all of the units 
produced.t This important class of commodities can be termed 
price-excludable public goods or (reflecting a characteristic they retain 
rather than one they lose) joint goods. Television and radio transmis- 
sions and the services provided by transportation facilities (rivers, 
roads, airports, ships, buses, and airplanes), entertainment fa- 
cilities (theaters, galleries, museums), recreational facilities (national 
parks, football stadiums), and information serve as examples, of vary- 
ing degrees of purity:? All are important in modern societies, and 
virtually all have been suspected of being suboptimally provided by 
the private sector. 

Competitive models dominate the published literature, and Samu- 
elson’s (1954, 1955) seminal analysis of pure public goods has clearly 
influenced the questions asked and the methodology employed. 
Analysis of the behavior of and interaction between individual profit- 
seeking entrepreneurs has been neglected in favor of somewhat 
speculative predictions of the likely characteristics of market equilib- 
rium so that efficiency issues can be immediately examined. This is 
true, to some extent, even of the important contribution of Oakland 
(1974), in which it is suggested that competition will generate an 
equilibrium with unfamiliar characteristics in the joint goods context, 
with consumers facing an increasing step function of prices for ho- 
mogeneous consumption units. * 

In contrast, we present an analysis of joint goods provision with the 
individual profit-maximizing entrepreneur as its focal point. How- 
ever, we depart from the analogous private goods methodology in 
one important respect: The familiar notion of the aggregate demand 
curve is replaced by the demand distribution as a novel, powerful, and 
operationally meaningful building block, reflecting the fact that the 
number of consumers of particular output units plays a critical role in 
joint goods analysis even under simple pricing strategies. Using this 


1 See, in particular, Buchanan (1967), Thompson (1968), Demsetz (1970), Oakland 
(1974), Auster (1977), Lee (1977), and Brennan and Walsh (1979); and also working 
papers by Brennan and Walsh (1978) and Burns (1979). 

* Pure joint goods (like pure public goods) would permit an indefinitely large number 
of consumers. The examples cited (except perhaps information) fall well short of this 
but, as long as they remain uncongested, can still be analyzed as if they were “pure” (like 
“local” public goods). By treating costs as varying with intensity of consumption, our 
approach could also be applied to the impure,cases. 

3 We accept Oakland’s description of equilibrium, but his analysis of its attainment is 
essentially circular. The process of establishment of equilibrium must be substantially 
different from that in the private goods case. In fact, it may be that the process is 
analogous to the seriatim introduction of “new” goods. 
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basis, our central contribution is a general exposition of the neglected 
case of monopoly provision of joint goods, and for a variety of pricing 
strategies we are able to identify characteristics of revenue functions, 
relative profitability, and operational procedures for selecting price 
and output levels. Nonetheless, our framework can also provide a 
basis for analysis of competitive joint goods provision and for analyz- 
ing marketing strategies for private goods, particularly where constant 
marginal costs prevail. 

In Section I we introduce the demand distribution, provide a 
geometric interpretation of it, and set out basic assumptions and 
definitions. Sections II and III examine the characteristics of a range 
of feasible marketing strategies (applied uniformly across individuals) 
from per-unit pricing to sophisticated multipart pricing procedures, 
each of which has been discussed in the private goods literature and 
has a basis in common business practice.4 Section IV contains an 
illustrative analysis using a specific distributional form, while Section 
V brings together the main results with final observations. 


I. Framework of the Analysis 


For monopoly provision of private goods under uniform per-unit 
pricing, information about aggregate demand provides an adequate 
basis for decision making. Since each output unit is exclusively con- 
sumed by a single individual, the profit-maximizing price will never 
be less than marginal production cost, and hence the firm will always 
produce as many units as are demanded by all individuals at that 
price. The aggregate demand curve thus completely specifies the 
relationship between per-unit price and both aggregate consumption 
and production, directly determining the revenue-output relation- 
ship. However, even for a private good producer, the information 
revealed by the aggregate demand curve is not sufficient when dis- 
criminatory pricing schemes are being considered: for these pricing 
strategies, information is required on the “composition” of demand 
for the product. 

In contrast, for monopoly production of price-excludable public 
goods, information on aggregate demand is inadequate even under 
uniform per-unit pricing. Since each production unit can be fully and 
equally consumed by all individuals, output need never exceed that 


* Dupuit ([1844] 1952) was an acknowledged authority on price discrimination and 
influenced the adoption, by the nineteenth-century French public sector, of quite 
sophisticated pricing procedures. More recent literature includes general analyses by 
Gabor (1955-56), Puu (1964), Ng and Weisser (1974), and Yamey (1974), and contri- 
butions on quantity discounting by Buchanan (1952-53) and Berglund (1964). In many 
cases the analysis has been applied to goods with significant jointness elements. 
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required to satisfy the highest demand individual at any price. 
Moreover, the per-unit price faced by each individual can be less than 
marginal production cost since many units are jointly consumed, but 
this would necessitate the rationing of some high-demand individuals 
by output rather than by price. Consequently, not only does the 
conventional aggregate demand curve not define the relationship 
between price and output for joint goods, in general it need not even 
define the relationship between price and aggregate consumption. 
Operationally, given his knowledge of production costs, the producer 
of a joint good will be concerned to identify the maximum revenue 
obtainable from (various) given output levels, and this critically de- 
pends on the composition of demand. Specifically, he will be in- 
terested in the number of individuals who would purchase (at least) 
a certain quantity when confronted with a particular (revenue-maxi- 
mizing) price, since this determines his marginal revenue. Com- 
pared with his private goods counterpart (who can work simply with 
output and price), the joint good producer must consider output, 
revenue-maximizing price, and the number of consumers of the mar- 
ginal production unit. Accordingly, we have adopted as the funda- 
mental building block of our analysis a construct which takes account 
of the critical distributional characteristics of demand, is based on 
variables that may be empirically observable in the normal process of 
operation, and has a basis in conventional demand theory. This con- 
struct we term the “distribution of demand” or, more succinctly, the 
demand distribution. 


The Nature of the Demand Distribution 


A point on the surface of the demand distribution defines the number 
of individuals, n, who would each consume at least q units of output if 
the joint good was made available at a per-unit price of p.° Formally 
(adopting the conventional private good assumptions that we are 
dealing with large numbers of individuals and that prices and quan- 
tities are divisible) we assume that the distribution may be represented 
by a well-defined function, n = n(p,q), with the properties 


On On _ 0, (1) 


Op’ oq 
so that, by the implicit function theorem, the inverse function, p = 


p(ngq), is also well defined. 
It is appropriate that we should briefly discuss the basis for these 


> Note esp. that n does not refer to the number who would consume some part of 
output q, but rather to those who would consume it all. 
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Fic. 1.—Cross section through demand distribution with price constant at p° 
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specific assumptions. In figure 1 we show, for a particular per-unit 
price, how the number of individuals consuming each unit of output 
is functionally determined by the distribution of demand. For n here 
to be a well-defined function of g clearly we must have the demands of 
individuals arranged in order of magnitude; but, more than this, we 
require that there never be a significant number of individuals with 
identical demands (at any uniform price). Thus, not only have we 
ruled out the possibility of a significant number of identical individual 
demand curves but also (perhaps more interestingly), while accepting 
that many such curves may intersect, we have ruled out the possibility 
that a significant number will intersect at identical values of g (and p). 
The strict inequality assumed in equation (1) also involves further 
considerations. First, it implies that variables which influence the 
pattern of demand (e.g., tastes, income, and wealth) are assumed to be 
distributed continuously among the population. Were this not the 
case, significant changes in q could be associated with zero changes in 
n and a well-defined inverse function, p(n,g), may not exist. Second, 
besides these implications regarding the relationship between n and q 
(with p constant), equation (1) also implies the assumption that indi- 
vidual demand curves (defined as q; = gilp]) are downward sloping.® 


§ William Oakland has suggested a neat demonstration of how the distributional 
function, n(p,q), might be derived formally from individual demand behavior. Let the 
inverse demand relations of a subset of individuals be given by p = p(q,a), with the shift 
parameter distributed according to f(a) over (a,a). Then, for given p and q, n is 
determined by n(a) = deg (s)ds. Moreover, the intersecting case is accommodated 
immediately by considering any number of different classes of such demand relations. 


E.g., with two classes, p = p(q,a) and p = $(q,b), we would have separate density 
functions for a and b and define n = n(a) + n(b). 
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Fic. 2.—Demand distribution 


The sum of all these restrictions yields the demand distribution as 
defined above, a geometric representation of which is provided in 
figure 2. Here the boundary values P, Q, and N define, respectively, 
the maximum price that would be paid for the initial unit of output, 
the maximum quantity that would be demanded by any individual at 
zero price, and the total number of individuals. Also shown are the 
solids whose volumes indicate the revenues obtained from alternative 
per-unit pricing arrangements associated with a particular output 
level. These revenue volumes reflect the fact that, for a joint good, 
each unit produced may be consumed by all individuals willing to pay 
the price, p, and are especially relevant to the analysis undertaken in 
Section II. 


The All-or-Nothing Distribution 


The incorporation into our analysis of the (desirable) feature that 
individual demand curves may intersect has important consequences 
for the treatment of all-or-nothing procedures. With respect to the 
perspective provided in figure 1, the presence of such intersections 
means that the order in which individuals’ demand curves are ar- 
ranged must vary according to the (constant) price level considered if 
ordering with respect to magnitude is to be preserved. Thus, in 
general it may not be assumed that a cross section through the distri- 
bution with n held constant will indicate the demand relation of a 
particular individual (although this will be so in the nonintersecting 
case). Even if we make the conventional assumption of zero income 
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effects so that p(n,q) indicates the marginal evaluation of different 
consumption units, the integral of this inverse function, given n, 
cannot be assumed to measure the “willingness-to-pay” of a particular 
individual. For this reason, and because users of all-or-nothing pro- 
cedures will generate market observations on variables different from 
those relevant to the per-unit pricing case, the analysis of these strate- 
gies (presented in Section II) has been undertaken with respect to an 
alternative (but related) distributional construct, the all-or-nothing dis- 
tribution. A point on the surface of this distribution defines the num- 
ber of individuals, n, who would purchase a bundle of ¢ units of the 
good at an average (per-unit) charge C rather than go without the 
good altogether. We assume, analogously to the demand distribution, 
that the all-or-nothing distribution may be represented by a well- 
defined function, n = n(C,q), with the properties 








On On 
< 
aC’ aq 0, (2) 


again ensuring that the inverse function, C = C(n,q), is well defined. 


Some Useful Definitions 


Finally, later discussion is considerably assisted by reference to par- 
ticular concepts which may be formally defined at this stage. The 
definitions made here with respect to the demand distribution may 
also be applied to the all-or-nothing distribution. 

DEFINITION |: A level of output qg will be termed efficient with respect 
to (a particular pricing procedure) if under that procedure: R(q) > R(q*) 
V q* <q (where R denotes total revenue). 

DEFINITION 2: A regular distribution of demand has the property: 
O?R (qo)/Op? < 0 (where R[qo] = plnr,qol - nlp.qol). 

DEFINITION 3: A weakly restricted distribution of demand has the 
property that: 0p*(qo)/Oqo < 0 -(where p*[qo] is that price which 
maximizes R[qo]). 

Definitions 2 and 3 are restrictions on the nature of the demand 
distribution, the former requiring concavity of the revenue function 
relating to consumption solely in respect of the goth unit of output.” 
Some of the more complex operational results derived in the paper 
require these plausible restrictions to hold, and in these cases the 


’ Definition 2 reflects the conventional simplification adopted in private good analysis 
that marginal revenue functions are strictly downward sloping. The restriction implied 
by definition 3 is lent plausibility by the observations that it will always be true if cross 
sections through the distribution for g constant are (mathematically) similar and that 
even where this is not so the overall trend in p*(q) must be negative. 
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results have been presented as propositions with proofs clearly 
identified in the text. 


II. Uniform Per-Unit Pricing 


We have previously emphasized the important distinction between 
output and aggregate consumption in the joint goods context. Since, 
operationally, the entrepreneur will be concerned with the profitabil- 
ity of various levels of output and because his production cost condi- 
tions need have no unusual features, our analysis of alternative mar- 
keting strategies can focus on his search for the revenue-maximizing 
price (and/or charge) associated with each potential output level and, 
more generally, on the characteristics of the revenue functions he 
faces. Initially, we suppose that the joint good producer seeks to 
maximize profit while setting a price which is uniform across both 
individuals and output units. 


Basic Characteristics of the Uniform Per-Unit Price 


The unusual nature of his pricing decision is made transparent by 
first supposing that the entrepreneur behaves analogously to his pri- 
vate good counterpart, allowing all individuals to consume as much as 
they wish at the prevailing per unit price. Since, for joint goods, 
output would be then determined by the highest individual con- 
sumption level demanded at each price, this strategy can be alter- 
natively represented by the assumption that at any output level the 
producer selects the maximum price at which anyone is willing to 
consume the entire output. This procedure we call Maximum Uniform 
Pricnmg (MUP): In figure 2, for output q it involves setting price 
p;, at which price the highest demand individual consumes the en- 
tire output, and a total of m, individuals consume at least part of it. 

The revenue yielded by this pricing procedure, for any output level 
Jo, 1S given by: 


R (qo)/MUP ae - n(po.q)dq, (3) 


where fy = p(0,qo) and is constant throughout the integration. Differ- 
entiation of (3) with respect to go yields the marginal revenue (MR) 
function:® 


0 0 ct) 
4 5 0 
is pol ap ribo) a 4 + Do. N (Poo): 


8 Discussion of the appropriate algebraic methods for derivation of this expression is 
to be found in Courant (1960), p. 220. 
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Note, however, that the last term in this expression, representing the 
effect of a change in output, price held constant, in the limit ap- 
proaches zero because n(o,4o) itself tends to zero. Thus, MR(qo)/MUP 
can be written as: 


0 
MR(qo)/MUP = [ER (qy)/MUP]-5r (4a) 
This marginal revenue function has a number of interesting 
properties (identified in the illustrated example in fig. 4 below), but 
more importantly it immediately suggests that for joint goods the 
optimal (i.e., revenue-maximizing) pricing strategy differs from MUP 
(the private goods analogue). Any level of output which is efficient with 
respect to MUP must have MR(q)/MUP > 0, and it follows directly from 
(4) and (4a) that, at any such output, revenue would be increased by 
lowering price (with output held constant). This is clearly illustrated 
in figure 2, where for output q, lowering price from fp, (i.e., p[q]/MUP) 
to py increases total revenue: revenue declines from the original 
consumption units, but this is more than offset by the fact that each 
output unit is now more intensively consumed. Thus, the Optimal 
Uniform Price, p(q)/OUP, will be less than the Maximum Uniform Price, 
p(Q/MUP, at any output level where marginal cost is positive. Moreover, 
some individuals (all those for whom qi[p] > q) will be rationed by output 
while others (for whom qilp] S q) are rationed by price, and this is indeed a 
general characteristic of optimal pricing for joint goods which does 
not apply for private goods.® 


Some General Properties of Revenue Functions under OUP 


Some insights into more general properties of the relevant revenue 
functions are straightforward. If OUP is charged for an output level 
qo, We must have 


ALR (qo/OUP] _ 
ae en 


It follows trivially, either from arguments similar to (3) and (4) above 
or from geometric considerations, that: 


MR(qo)/OUP = [p(qo)/OUP] + n[p(qo)/OUP, gol. (6) 


(5) 


* The possible need to ration by output for per-unit pricing was first recognized in 
Brennan and Walsh (1978). Strictly speaking, rationing applies only if at the profit- 
maximizing output MC > 0 (see subsection, “Some General Properties of Revenue 
Functions under OUP,” following). Moreover, while for p > MC > 0 rationing cannot 
sensibly be considered to apply, the range of output levels for which this occurs is 
negligible when large numbers of consumers are involved. It should be observed, also, 
that the use of continuous mathematical methods makes it appropriate to regard 
n[p(qo)/OUP,qo] > 0 as indicative of rationing even for the special case of 1 > 
nlp (qo)/OUP, qo] 0. 
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That is, under OUP marginal revenue (with respect to output) will be equal to 
the revenue obtained solely in respect of the final (i.e., least consumed) unit 
of output. Regarding the characteristics of the function MR(q)/OUP, 
it clearly can never exceed max [p(n,q) - n(p,q)] which, given the 
properties of the demand distribution (1) above, must decrease as 
output increases. Since MR(0)/OUP = max [p(n,0) - n(p,0)] marginal 
revenue under OUP must obtain its maximum value at zero output, 
although it may sometimes increase with output at some nonzero 
production levels.1° 

The results (5) and (6) also yield insight into the characteristics of 
OUP when output is selected to yield the maximum possible revenue. 
Since this situation must involve p(q)/OUP > 0 but MR(qg)/OUP = 0 it 
follows from (6) that there will be no rationing. In this case the OUP 
solution has MUP characteristics and, since (5) always holds for OUP, 
is in fact identical to the revenue-maximizing solution for MUP. 


Output and Price Determination under OUP 


Since profit-maximization requires the equality of marginal cost and 
marginal revenue, we have begun by deriving and identifying the 
properties of the marginal revenue function appropriate to OUP, 
properties which (with respect to output) turn out to be similar to 
those observed in the private good case. The actual process of output 
determination under OUP is not unduly complicated. Once the Op- 
timal Uniform Price has been (experimentally) determined for a 
particular output level it follows directly from (6), providing con- 
sumption of the final unit of output is known or can be estimated, that 
marginal revenue is also determined. However, the decision to in- 
crease or decrease output must be accompanied by a decision re- 
garding price adjustment, and here the user of OUP is confronted, in 
complete contrast to the conventional (private goods) wisdom, by the 
possibility that the profit-maximizing price may rise as the profit- 
maximizing output increases, ceteris paribus. In order to show this we 
first derive a general result: 

Proposition 1: The Optimal Uniform Price for a given output 
level, p(q)/OUP, may generally be expected to lie in the range 
bounded by the previous such price, p(q — dq)/OUP, and the price 
which maximizes revenue solely in respect of the qth unit of output, 
p*(q). This will always be so if the demand distribution is regular. 

ProorF: If a distribution is regular, for a given g and dq, the revenue 


‘© The operationality of expression (6) does, of course, require that it is possible to 
identify how many individuals consume the entire output. Lack of this information in 
no way invalidates the results derived for total and marginal revenue functions 
throughout the paper, but it must be recognized that practical determination of MR 
often may be less straightforward than is implied by the simplicity of the stated results. 
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expression p(n,q) « n(p,q)dq is a concave function of price, as there- 
fore must be the sum of any such expressions. The revenue obtained 
from per-unit pricing q units of output may be expressed as 
[ie-4p (n,q) * n(p,q)dq + p(n.q) - n(p.q)dq, which is therefore the sum 
of two concave functions of price. As such its maximum must be 
attained at a price bounded by those prices which maximize each of its 
parts, p(q — dq)/OUP and p*(q). Even if the functions are not globally 
concave with respect to price, since they are continuous they must be 
locally concave at their maxima. Where the maxima are not at 
significantly different prices, as well as in those cases where con- 
vexities in the revenue functions are not severe, we might also expect 
p(q)/OUP to be bounded as suggested. [_] 

This conclusion, which is useful in the derivation of later results, 
also has some operational content. It suggests that in the absence of 
information about the trend of p*(q), larger output expansions could 
initially be accompanied by a price decrease, and for small increases in 
output (since p*[q + dq] — p*[q]) perhaps price initially should be set 
between p(q)/OUP and p*(q). More immediately, however, it leads toa 
fundamental observation: 

PRroposiTION 2: The Optimal Uniform Price, p(q)/OUP, may some- 
times increase as output increases. However, this will never occur if 
the demand distribution is regular and weakly restricted. 

Proor: The first part follows directly from the possibility (illus- 
trated in fig. 3, where p*[1] applies strictly to the initial unit of output 
and hence is equal to p[1]/OUP) that p(q — dq)/OUP may be less than 





Iso-revenue locii (in respect of 
individual units of output) 


0 N 


Fic. 3.—Distribution characteristics yielding price increases as output increases 
under OUP. 
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p*(q). For the second part, given a weakly restricted distribution, so 
that p*(q — dq) > p*(q), and regularity, it follows from proposition | 
that 


p(0)/OUP = p*(0) > p*(dq) 
op (O)/OUP > p(dq)/OUP > p*(dq) > p*(2dq) 
“.p(q — dqgVOUP > p(qg)/OUP >p*q). O 


The phenomenon of (optimal) price rising when output is increased 
(i.e., for reductions in marginal cost) appears to be unique to the joint 
goods context. Moreover, even in the case of regular and weakly 
restricted distributions, the more conventional fall in price (as output 
rises) may be very small compared with that expected for a private 
good."* Other unusual phenomena, though perhaps to be given no 
more attention than peculiarities which can arise in traditional private 
goods cases, may also occur. For example, the optimal number of 
individuals to ration may (perversely) increase as output increases. 


An Overview of Results for Uniform Per-Unit Pricing 


Except in the special case where marginal cost is zero, Optimal Uni- 
form Pricing of joint goods for any output involves a price lower than 
the Maximum Uniform Price for that output, and, consequently, 
some individuals will be rationed by output. Marginal revenue for 
OUP can be defined straightforwardly as the revenue obtained solely 
in respect of the last unit of output; the characteristics of the MR 
function are similar to those applying to private goods; and _ pre- 
scriptions regarding price and output determination can be obtained 
with only slightly more difficulty than in the private goods case. 
However, the Optimal Uniform Price may increase as output in- 
creases, and, even if this result does not arise, the price reduction 
associated with output increases may be small relative to those which 
would occur if the good was private. 


IiI. More Complex Pricing Strategies 


Observation of common business practice suggests that in the course 
of their marketing operations firms can generate information which 


1! Brennan and Walsh (1978) isolate a result in which price increases with output, but 
one which applies to specific points (actually “gaps”) in the price-output relationship 
and which appears to emerge from the small-numbers case they deal with. Our analysis, 
however, using large numbers, confirms that the whole trend of prices may be upward 
over some output ranges. Our other result—that price falls may be small—is illustrated 
by the linear distribution example in Section IV since P/2 > p(q)/OUP > P/3, where P is 
the price intercept of the highest demand individual. 
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permits them to adopt pricing strategies more sophisticated than 
uniform per-unit pricing. Given the “service” nature of the typical 
examples, monopoly producers of joint goods may be well placed to 
consider the use of all-or-nothing charges, entrance or license fees, 
and procedures which price units (or bundles of units) separately, 
though on operational grounds we assume that such procedures are 
applied uniformly across consumers. As with per-unit pricing, our 
analysis of these possibilities is concerned to identify basic charac- 
teristics of the optimal pricing arrangement and operational aspects 
of price and output determination. We are now also concerned, 
however, to consider the relative profitability of the alternatives open 
to the firm, and for this purpose the demand and all-or-nothing 
distributions become all the more important. However, to enable 
clear-cut results to be generated we are obliged to assume that income 
effects are negligible. Since substitution effects can generally be ex- 
pected to dominate income effects, it is likely that our results are 
broadly applicable, but the possibility of conclusions being reversed by 
income effects in some instances cannot be ruled out. 


All-or-Nothing Pricing 


All-or-nothing type pricing schemes are exemplified by “season ticket 
only” offers for a series of operas or Shakespeare plays and flat-rate 
bus or subway fares. We assume that the same all-or-nothing offer 
confronts all individuals for a particular output bundle offered by the 
producer, though it is convenient to express this offer in terms of an 
average (per-unit equivalent) charge which must be paid for all units 
in the bundle. The Optimal (revenue-maximizing) All-or-Nothing 
Charge (OAN) will be such that the marginal consumer pays his average 
valuation for the units in the bundle and the OAN strategy inherently involves 
rationing by output. 

It is interesting to note that, in the special case where individual 
demand curves do not intersect, there is an obvious symmetry be- 
tween the all-or-nothing and per-unit pricing procedures; as is dis- 
cussed more fully below, in that case the revenue volumes (and re- 
lated algebraic expressions) may be similarly defined, except that the 
roles of p and n are reversed. However, to avoid these restrictions on 
the demand distribution we initially focus attention on the all-or- 
nothing distribution defined in Section I. Here, for a given output, go, 
revenue is defined by: 


R (qo) a qo C(n,qo) “Nn, (7) 


where C is the (per-unit) all-or-nothing charge for all units in that 
output bundle. The optimal value of n and associated charge C may 
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be obtained directly from the first-order condition: 


OR (qo) /OAN OC 
= gen + Clnqo)| = 0. (8) 


This must be true for all values of g and defines n as a function of q for 
optimal values of C. It is straightforward to proceed from here and 
obtain: 


OR(q) S 
oe dn*, 





MR(q)/OAN = n - C(n,q) + (9) 
where dn* is the change in n when q is increased by one unit with C 
held constant (at its previous optimal level). 

Operationally, marginal revenue thus defined may be obtained in 
just two steps: First, locate the optimal per-unit charge for a given 
output level (which gives us n - C[n,q]), and second, note the change 
in the number of consumers when output is increased by one unit and 
the charge (per-unit) is held constant (which yields the second part of 
eq. [9]). Evidently, MR(q)/OAN 1s not much more difficult to determine than 
MR(q)/OUP. 

Of more immediate interest, however, is the revenue yield of all- 
or-nothing charges relative to OUP. Insight may be obtained by 
limiting attention to the case where individual demand curves do not 
intersect. In these circumstances, R(q9)/OUP involves choosing p to 
maximize J “*p - n(p,q)dq while R (q))/OAN involves choosing n to maxi- 
mize Joon - p(n,q)dq (in the nonintersecting case p[n,q] being, for 
given n, the nth individual’s inverse demand function). The natural 
symmetry between the two procedures leads directly to two observa- 
tions. First, for any distribution which is symmetrical in p and n (..e., 
so that if p =a,n = 5 satisfies n = n[p,qo], so must p = b andn = a) for 
some scale of measurement of these variables, the revenues must be 
identical; second, the existence of some distribution for which R (q)/ 
OAN > R(q)/OUP implies the existence of a distribution yielding a 
contrary result. Thus, OAN may be superior or inferior to OUP in revenue 
terms depending upon the characteristics of the demand distribution. More- 
over, the general properties of the two strategies may be quite similar. 
For example, by arguments directly analogous to those used in propo- 
sitions 1 and 2 it follows that the optimal value of C may increase as output 
increases, but not if the all-or-nothing distribution is both regular and weakly 
restricted. It is also possible that the number consuming a given bundle, 
n(q)/OAN, may sometimes increase as output increases.” 


” See again Courant (1960). As an approximation, the linear form MR(q)/OAN = 
MG/2 may be useful in estimating whether the unit output expansion should be tried. 

‘8 This can be seen easily for a distribution with nonintersecting demands (but no 
other restrictions), since then OAN analysis can be conducted with the basic demand 
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Given that selection of the Optimal All-or-Nothing Charge may be 
only slightly more difficult than selection of the Optimal Uniform 
Price, there is virtually no theoretical basis for choosing between the 
two procedures. This should cause little surprise because the degree 
of complexity underlying the maximization problem faced by the 
entrepreneur is identical in both cases. Alternatively, insight into the 
ambiguity in revenue ranking may be obtained from the realization 
that C(q)OAN may be S p(q)/OUP depending upon the distribution and 
(compared with a uniform per-unit price equal to C) that OAN involves 
the additional consumption of (lower-valued) units by some individuals while 
excluding all (higher-valued) consumption by others. a4 


Two-Part Pricing 


The characteristic feature of Two-Part Pricing (TPP) is the combined 
use of an entrance (or license) fee and a uniform per-unit price—a 
combination which is commonly used in the marketing of recreational 
and entertainment services. The essence of the TPP case can be 
captured in the following: 

Proposition 3: The introduction of a license fee in conjunction 
with uniform per-unit pricing may reduce revenue. However, if the 
demand distribution is both regular and weakly restricted and p(q)/ 
OUP is used as the per-unit price, revenue is necessarily increased. 

ProorF: With uniform per-unit pricing, more individuals consume 
the “initial” unit of output than any other unit. The marginal intro- 
duction of a license fee may be viewed as raising the price of the initial 
output unit without affecting the price or consumption of later units. 
The revenue effect will, therefore, be 


O[p(n,0) - n(p,0)] 
Op . 
and even for a regular distribution this may be positive or negative 
depending upon whether the original per-unit price was below or 
above p*(0) (the price which maximizes revenue from the first unit 
alone). It follows directly from proposition 2, however, that if OUP is 


distribution but with p and n reversed: Whatever holds for OUP, an analogous result 
holds for OAN. This similarity between OAN and OUP, moreover, suggests an 
analogue to MUP which might be termed “simple all-or-nothing pricing” (SAN). For 
output q, it involves setting a charge so that all who have positive marginal evaluations at 
that output are permitted to consume, the marginal consumer paying his total evalua- 
tion. R(q)/SAN = R(q)/MUP for nonintersecting demands with the distribution sym- 
metrical in p and n. 

4 Indeed, for the linear distribution used in Section IV we have both that C(q)/OAN 
= p(q)/OUP and that the additional and excluded consumptions are identical. There- 
fore, R(q)/OAN = R(q)/OUP. 
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used and the distribution is both regular and weakly restricted, the 
revenue effect must be positive. 1 

That this proposition has two distinct components is quite inten- 
tional, since together they shed light on a trade-off which gives rise to 
the existence of an interesting possibility. From the first part of the 
proposition, if a given output level has a uniform price greater than 
p*(0), the introduction of an entrance fee will cause a greater loss of 
revenue from now-excluded consumers than is gained from entrance 
payments by the remaining consumers. From the second part of the 
proposition, the two effects yield a net revenue gain for a given 
output level for any uniform price lower than p*(0). Insight into the 
distinction between the two cases may be aided by regarding the first 
as involving a relatively high per-unit price and small number of 
consumers and the second as involving the reverse.'’* However, in 
lowering the per-unit price to make an entrance fee profitable (in an 
absolute sense) the departure from the uniform per-unit price p(q)/ 
OUP may be sufficiently large to involve a greater loss of revenue 
than can ever be captured by the fee. Thus, for some distributions it may 
never be profitable (relative to OUP) to use an entrance fee in conjunction with 
uniform per-unit pricing (1.e., TPP). 

Notwithstanding this interesting possibility, proposition 3 does sug- 
gest that under plausible conditions some form of two-part pricing 
will be superior to OUP, and our conceptual experiment of intro- 
ducing a marginal entrance fee suggests there is no great difficulty in 
gaining some extra revenue. Indeed, if for any given output level the 
revenue-maximizing license fee and per-unit price could be deter- 
mined, there would be little difficulty in locating the profit-maxi- 
mizing output level, since we must have: 


MR(q)/TPP = p - n(p.9), (10) 


where p is the relevant per-unit price for output level g. This unsur- 
prising result follows from arguments similar to those used to derive 
MR(q)/OUP above and, indeed, it is an interesting property of many 
marketing strategies that the marginal revenue function is defined by 
the revenue obtained solely in respect of the final unit of output. The 
reason is straightforward. The optimal setting of the parameters for 
any such strategy implies that marginal changes in those parameters 
will not change the revenue associated with any given output level. If 
a marginal increase in output results in only an incremental change in 
the price parameter applying to final production units, then revenue 
with respect to the original output level will be unchanged, and any 


' Strictly speaking, allowance should also be made for the way n changes as p changes 
in the two situations. 
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change in overall revenue must be simply that amount obtained solely from the 
final output unit."® 

Other results (related to those for OUP) follow almost immediately 
from (10). Except where marginal costs are zero, TPP will involve 
rationing of some individuals by output; and the per-unit price component may 
increase with output but it will not do so if the demand distribution is 
both regular and weakly restricted. In these latter circumstances it can 
also be established that the per-unit price element of TPP is less than 
p(q)/OUP.” However, even allowing for this last insight and despite 
the simplicity of the marginal revenue function, the entrepreneur 
would still need to undertake innumerable experimental variations in 
both the license fee and per-unit price components at each output 
level. It remains unclear, therefore, whether TPP dominates OUP (and, a 
fortiori, OAN) in terms of overall ( operational) profitability even when rt does 
so in terms of potential revenue-raising capacity. 


Multipart Pricing 


We now suppose that the producer considers confronting consumers 
with a schedule of prices in which different units (or bundles of units) 
attract a different price (though the same price schedule is applied to 
all consumers). This strategy might be exemplified (though perhaps 
somewhat imperfectly) by “quantity discount” offers for joint goods, 
such as are applied to a series of orchestral concerts, operas, or plays. 

Obviously, the Optimal Multipart Pricing procedure (MPP) will 
involve pricing each unit separately at the price, p*(q), which maxi- 
mizes revenue obtained solely in respect of that unit (although, as 
we observe later, for operational reasons the producer may offer 
different sized bundles of units with a composite charge, or average 
per-unit equivalent, equal to the sum of the revenue-maximizing 
separate prices for the units it contains). The price-output relation- 
ship is, therefore, immediately defined and, as with other strategies, 
price may increase with output over some ranges. However, in all cases the 
overall trend must involve prices decreasing with output, and for a 
weakly restricted distribution (this restriction directly relating to p*[q]) 
the price-output schedule would necessarily be negatively sloped, with 
higher prices charged for the most intensively used (“lower level’) 
units of output. 


'6 The reader may wish to confirm why such arguments do not apply to the MR(q)/ 
OAN function defined with respect to parameters of the all-or-nothing distribution. 

™ Given 0C/0q, 0C/dn < 0, the per-unit price component of TPP will be that which 
maximizes revenue in respect of the last (q¢ — q*) units, g* being the minimum con- 
sumption of any (nonexcluded) individual. From propositions 1 and 2 this price must 
be less than p(q)/OUP given the stated restrictions. 
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For the MPP strategy, marginal revenue is trivially defined as the 
maximum revenue obtainable solely in respect of the gth unit, {p*(q) 
‘ n[p*(q), q]}, and, given the properties of the demand distribution in 
(1) above, MR(q)/MPP must decrease as output increases. Equally 
clearly, R(q)/MPP must exceed R(q)/OUP, except in the special case where 
p*(q@ = p*(O) for all q, in which case the strategies are identical in all respects. 

This superiority of MPP might have been expected to extend 
straightforwardly to comparisons with other relatively simple strate- 
gies, such as OAN. Interestingly enough, no such exact result can be 
obtained unless the restriction is made that individual demand curves 
do not intersect. In these special circumstances, the marginal con- 
sumer of a bundle of units under OAN necessarily has the lowest 
marginal valuation of each and every unit in the bundle. The revenue 
obtained under OAN would, therefore, be equal to that obtained 
from a form of multipart pricing which makes the units available at 
the separate prices, p»(q), given by this marginal individual’s demand 
curve. However, except where (quite fortuitously) those prices are also the 
optimal p*(q) this OAN revenue must be less than that obtained under MPP. 
In more general circumstances where demand curves do intersect, it 
can no longer be presumed that all individuals who would consume 
the entire bundle at the OAN composite charge would still do so if the 
units were priced separately at p,,(q). Even though it is also true that 
some individuals who would not purchase the bundle at the composite 
charge would undertake some consumption at these separate prices, it 
remains possible that R(q)/OAN may exceed the revenue obtained 
from the multipart pricing arrangement using p,,(q). Since we do not 
know by how much R(q)/MPP exceeds this latter arrangement, we 
cannot be certain that it in fact exceeds R(q)/OAN. And this uncer- 
tainty extends to comparisons between MPP and any strategy involv- 
ing a lump-sum component, including TPP. It appears that the general 
expectation that MPP will be the revenue-dominant strategy, while intuitively 
plausible, can only be established in a probabilistic sense. Nonetheless (and 
notwithstanding some operational difficulties referred to below) MPP 
might be regarded as closest to a theoretical “best” procedure in 
revenue terms when interpersonal discrimination is ruled out. 

Two further aspects of the MPP strategy might be noted. First, the 
notion of (a schedule of) prices related to “intensity of use” immedi- 
ately suggests a comparison between the MPP solution and Oakland’s 
(1974) competitive solution. Although for Oakland’s case the price 
schedule is unambiguously positively related to output while our 
monopoly counterpart may characteristically involve a negative re- 
lationship between price and output, the similarities are more than 
merely superficial. For a given demand distribution and marginal cost 
schedule, the Oakland competitive output and the monopoly output 
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under MPP must be identical, since both solutions must be character- 
ized by marginal cost being equated to the “total price” (n - p) which 
maximizes revenue with respect to the last unit of output and, in both 
cases, this will in general be associated with rationing by output.'® (Of 
course for Oakland no above-normal profit exists on any unit, and 
consumption by inframarginal consumers is greater for his solution 
than under MPP because inframarginal units bear a lower price in the 
competitive case.) 

Second, however, given that the characteristic MPP solution in- 
volves a negatively sloped price schedule, the operational status of 
MPP differs markedly from that of Oakland’s solution. For MPP we 
must assume that producers can acquire information on the relevant 
revenue-maximizing prices, can prevent resale (or exchange) of units, 
and can prevent consumers from “reversing intensity” (i.e., purchas- 
ing lower-priced units without first purchasing the higher-priced in- 
framarginal units). Resale possibilities may be of little concern in the 
joint good context given the service nature of most of the examples. 
However, preventing reversals of intensity requires either that the 
producer can completely meter individual purchases or that he can 
present consumers with a choice between bundles of units (containing 
1, 2, 3, and so on, up tog units) each with a composite charge equal to 
the sum of the revenue-maximizing prices for the separate units it 
contains (a strategy we might call MPP*). However, for MPP* to be 
made operational, producers must be able to determine the optimal 
set of composite charges (and hence the relevant p*[q]), and this may 
be a great deal more complex than isolating the p*(q) when intensity 
reversals do not occur. In general, firms would initially have to reduce 
output to zero and proceed sequentially—first determining the rev- 
enue-maximizing charge, p*(1), for one unit of output, then pro- 
ducing a second unit and determining the optimal composite charge 
for a two-unit bundle (i.e., CC*[2] = p*[1] + p*[2]), and so on.” If 
resale possibilities or problems of overcoming intensity reversals 


'® Oakland’s output is the maximal quantity, g, for which k - p,(q) = MC (ranking 
individuals k = 1,...,K so that p,[q] is the highest inverse demand). Ifk - p,(q) > MC 
(for some k > 1) when q is such that p,(q¢) = MC, output is expanded and so on. This is 
equivalent to increasing output whenever the maximum revenue from the marginal 
production unit exceeds MC and, unless g is maximal where p,(q¢) = MC, at least the 
highest demand individual is rationed by output. A referee suggested that the equality 
of output for MPP and competition is analogous to the private good result for perfect 
discrimination and perfect competition. This may be potentially misleading: MPP is not 
perfect discrimination, and in general the MPP (and hence competitive) output is not 
efficient. 

* Though this may be a protracted experiment and fairly costly in forgone-revenue 
terms (particularly if fixed costs are high), if the revenue-maximizing charge declines, 
firms may be able to minimize their losses by allowing consumption of further units at 
the per-unit price p*(1) while experimenting to find the relevant charges for bundles of 
two or more units. Similar but more complex procedures are required if the optimal 
charge may increase over some output ranges. 
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create insurmountable problems, then the producer will be obliged to 
set prices such that p(q) < p(q + dq) for allg, and the most that can be 
said is that for a weakly restricted distribution, the optimal separate 
prices are in fact uniform and therefore coincident with OUP. 


Summary of Results 


The more complex, but uniformly applied, pricing strategies con- 
sidered here retain two basic characteristics of optimal uniform per- 
unit pricing: All generally involve rationing by output, and all may 
generate a positive price-output relationship over some output 
ranges. For all the strategies, moreover, the relevant marginal reve- 
nue functions can be defined with little or no more difficulty than 
under OUP. Optimal All-or-Nothing pricing (OAN) is operationally 
little more complex than OUP, but there are no strong theoretical 
grounds for choosing between OAN and OUP in revenue terms. 
Selection of the Optimal Two-Part Price (TPP) is more difficult than 
selection of the OUP (and OAN), and its revenue dominance is 
assured only for restricted (though plausible) forms of the demand 
distribution. Optimal Multipart Pricing (MPP) is revenue dominant 
over OUP, but its dominance over OAN and TPP cannot be univer- 
sally guaranteed, and its position as the “theoretical best” strategy can 
only be established in probabilistic terms. The MPP output level is, for 
given cost and demand conditions, identical to Oakland’s competitive 
output, but for operational reasons MPP may be of somewhat limited 
commercial applicability. 


IV. Illustrative Analysis with a Specific Distributional Form 


Both to illustrate the methodology used in the general analysis and to 
consider some of the welfare properties of monopoly provision, we 
now analyze the results that emerge when a specific demand distri- 
bution is used. In choosing a distributional form, account has been 
taken of three factors. First, there would be heuristic gains from 
adopting a framework which could perform an expositional role 
similar to that of linear demand curves in traditional demand analysis. 
Second, the chosen distributional form should exhibit the more fun- 
damental characteristics of distributions likely to be met in practice. 
Third, the framework should not be devoid of operational content. 
Consideration of these factors leads more or less unequivocally to the 
choice of a distribution similar to that shown in figure 2 but for which 
the surface is linear.2? To enable the framework to include all-or- 


20 On this, and for a further application of results obtainable from a linear demand 
distribution, see Burns (1979). 
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nothing charges we also assume that demand curves do not intersect 
and that income effects may be regarded as negligible. 

Within this framework, the bounding values P,Q, and N completely 
define the distribution and may be treated as (empirically determina- 
ble) constants throughout the analysis which follows; consideration 
of alternative pricing procedures can be simplified considerably by 
drawing on the proportionality properties of similar triangles. In this 
respect it is convenient to define 


papain pandias (11) 


where p, q, and n represent specific values of price, output, and 
numbers of individuals. The actual derivation of the required reve- 
nue functions and pricing rules for a range of pricing strategies has 
been presented in some detail in Burns (1979), and the summary 
information contained here in table 1 and figure 4 is sufficient for 
present purposes. 

In table 1 seven alternative strategies are defined, the associated 
revenue functions and pricing rules being shown for each of these 
strategies. The revenue functions have been presented in a form that 
makes self-evident the question of relative profitability (for given 
output levels) and illustrates certain results obtained earlier, such as 
the general dominance of OUP and OAN over MUP and SAN. These 
functions also provide a neat illustration of two further points. First, 
and unsurprisingly, revenue (for given output) is positively correlated 
with the degree of complexity of the strategy adopted. Second, 
broadly speaking, schemes of comparable complexity should be ex- 
pected to yield similar revenues: for example, we can see that there is 
nothing to choose between OUP and OAN, although nonlinear dis- 
tributions could be constructed (but probably not empirically deter- 
mined) to favor either of these procedures. It is also apparent from 
the revenue functions that the returns to increased complexity of 
strategy diminish very rapidly, particularly at lower output levels. 

Figure 4 is fundamental in illustrating the basic methodology sug- 
gested in this paper. Entrepreneurial decision making concerning 
output and price determination relies heavily on information about 
the marginal revenue function of a chosen strategy and, of course, on 
the relationship of such a function to (marginal) cost behavior: figure 
4 contains the derived marginal revenue functions for each strategy. 
Also shown is a schedule indicating the sum of marginal valuations at 
different output levels: the intersection of this schedule with a margi- 
nal cost function would determine the optimally efficient output. 
Assuming, initially, constant marginal costs, the results for a linear 
distribution offer a clear warning that, over a substantial range of 
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Marginal Revenue 
$/PN 
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Output 


Fic. 4.—Marginal revenue functions for a linear distribution 


(higher) marginal cost values, there may be negligible differences in 
the outputs achieved by different procedures of varying complexity 
and that the outputs attained by even the most complex procedures 
may be considerably less than the optimally efficient output. On the 
other hand, at lower levels of marginal cost the difference in output 
levels becomes more pronounced, indicative of the fact that more 
complex procedures achieve their maximum possible revenues at 
higher output levels, but in general all procedures more nearly ap- 
proach the optimally efficient output level, especially the more 
sophisticated ones. Finally, noting that the monotonicity of MR(q)/ 
MPP is guaranteed while that of other marginal revenue functions is 
plausible,”’ a situation of decreasing marginal costs will increase the 


*1 E.g., although MR(q)/OUP may sometimes increase with output, its general trend 
must be downward (see Section II, “Some General Properties of Revenue Functions 
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divergence in output levels, while increasing marginal costs would 
have the converse effect. 

In conclusion, we offer a cautionary remark about comparisons 
between privately achieved and optimally efficient output levels. Our 
results indicate (not surprisingly) that suboptimal output levels can be 
expected for all of the private procedures considered. While these 
results are relevant to the concern shown by a number of authors who 
have made such comparisons in other contexts (and have sometimes 
claimed that efficient outputs might be secured privately), it is clear 
that comparisons of output levels alone are of little interest. In the 
case of the linear distribution, outputs under OUP and OAN would 
be identical, but the efficiency implications of the two procedures are 
significantly different. Indeed, one of the insights generated by the 
approach advanced in this paper is that the replacement of a uniform 
per-unit pricing strategy with a (comparably complex) all-or-nothing 
arrangement will succeed only in substituting the consumption of 
lower-valued units of output by higher-demand individuals for the 
consumption of higher-valued units of output by lower-demand indi- 
viduals. In general, efficiency comparisons must be based on the 
degree of consumption rationing (whether by price or by output) that 
occurs for all consumers vis-a-vis the completely efficient outcome in 
which all individuals would consume all units produced and output 
would be such that the sum of marginal valuations equals marginal 
cost. 


V. Concluding Remarks 


A major contribution of this paper is its demonstration that the 
complete demand distribution provides a novel and powerful basis 
for analyzing private provision of price-excludable public (or joint) 
goods. For a monopoly producer of such goods we have identified, 
under surprisingly general conditions, the characteristics of the reve- 
nue functions associated with various pricing strategies, indicated the 
likely relative profitability of those strategies, and prescribed proce- 
dures which might aid firms in isolating profit-maximizing price and 
output for any chosen strategy. Moreover, the specific example we 
provide, based on a linear demand distribution, not only serves to 
illustrate our general results, but also in practice may constitute an 
important empirical basis for generating first approximations for 
output and price levels. 


under OUP”). Moreover, given MPP as the theoretical best strategy, MR(q)/MPP would 
be the boundary form toward which other MR functions tend (as complexity of the 
pricing strategy is increased), and MR(q)/MPP is unambiguously negatively sloped. 
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Using this linear demand distribution, we have also sketched out 
some efficiency implications of our results, and, in this regard, an- 
other important application of our general approach is highlighted. 
For joint goods, overall efficiency depends on how output is “ra- 
tioned” among consumers as much as on the level of output per se. 
A framework containing information on the distribution of demands 
clearly is indispensable to a comprehensive analysis of the efficiency of 
private (or, indeed, public) provision of joint goods. In fact, our 
framework could be used profitably as a basis for a general analysis of 
joint goods provision under competitive conditions and other market 
or nonmarket structures. Moreover, the demand distribution ap- 
proach is equally essential to analysis of “discriminatory” pricing 
strategies for private goods, the necessary extension of the analysis 
presented here being particularly straightforward when constant 
marginal costs prevail. 
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Comments 





Recalculating the Scientific Tariff 


Bruce R. Bolnick 


Harvard Institute for International Development 


In part 2 of his classic 1960 paper, “The Cost of Protection and the 
Scientific Tariff,” Harry G. Johnson analyzed the second-best’ prob- 
lem of using tariffs to promote “noneconomic” objectives commonly 
suggested as arguments for protection. Very simply, Johnson showed 
how to compute the balance between marginal benefits and marginal 
costs using Harberger-type welfare triangles. He illustrated the meth- 
odology for five selected noneconomic objectives: (a) a tariff to pro- 
mote self-sufficiency and independence; () a tariff to promote di- 
versification, industrialization, or agriculturalization; (c) a tariff to 
promote a “way of life”; (d) a tariff to increase military preparedness; 
and (e) a bargaining tariff. 

Johnson assumed that world prices of importables are given, that 
domestic production of importables is positive before and after im- 
position of the tariff, and that cross-elasticity effects can be neglected.” 
Using linear demand and supply curves and free-trade values for 
market parameters, Johnson computed the marginal cost of an ad 
valorem tariff (¢) to be precisely 


MC = tPe + tCn, (1) 


where P and C are the free-trade levels of domestic production and 
consumption, and € and 7 are the compensated own-price elasticities 


I would like to express my great appreciation to Professor Edward Tower for 
invaluable discussions and suggestions. 

' Strictly speaking, the use of tariffs for such purposes is not necessarily second best. It 
may be third best if the “noneconomic” objective implies a divergence in factor markets; 
or it may be first best if a trade divergence is involved (see Corden 1974, chap. 2). 

2 Units of output are assumed to be defined so that the world price is unity. This 
assumption, along with the others listed in the text, serves to simplify computations. 
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of supply and demand. (I have dropped Johnson’s subscript 7, indi- 
cating parameters for good i, in order to simplify the notation.) 

Given any specific tariff-related policy objective, the marginal 
benefits (MB) can be computed as a function of ¢ and the parameters 
P, C, €, and 7. Johnson’s methodology then involves setting the 
ratio MC/MB (call this Z) equal to unity and inferring the characteris- 
tics of the scientific tariff structure from the resulting expression.® 
However, the condition Z = 1 is neither necessary nor sufficient for 
determining the scientific tariff unless the prohibitive tariff (which 
satisfies Z = 1 only by accident) can somehow be ruled out of con- 
sideration as the second-best optimum. Furthermore, even excluding 
the prohibitive tariff from consideration, Z = 1 is sufficient only when 
it has a unique root in ¢, which will be true only for a special class of 
policy objectives (which define MB). Thus the scientific tariff can be 
inferred from the first-order conditions in only a limited set of 
circumstances—even if one adheres to the simplifying assumptions 
underlying Johnson’s analysis. 

These problems can be illustrated using Johnson’s own case of 
“self-sufficiency,” which he chose to measure by “the proportion of 
consumption supplied from domestic production.” This gives 


MB = B(€ + n)(1 —tny, (2) 
where is the policy weight attached to marginal units of the objec- 


tive, in numeraire units.4 Thus 


EMC USNC 
tpn als 


a - ny, (3) 
which clearly is cubic in ¢ and can therefore generate multiple roots® 
for the equation Z = 1. Note too that the prohibitive tariff (for which 
C = P) is at the level ¢, satisfying t, = (C — P)/(Cy + Pe). 

Figure 1 shows how MB and MC vary with t. Here MB(B,) is the 
marginal benefit line for a low value of B, in which case the scientific 
tariff (¢,; where Z = 1) is unambiguously preferred to the prohibitive 
tariff (¢,). For some policy weight B,, MB (B,) will satisfy the property 


* Specifically, from the total differential of Z one can determine the dt necessary to 
offset dX (X = e, n, P, or C), thereby characterizing the differences in ¢ as the market 
parameters vary across goods or over time. 

* I have introduced the parameter £ into Johnson’s model to make MB commensu- 
rate with MC, in order to clarify the discussion. This parameter is assumed to be 
constant, purely as a simplifying device. 

° Johnson, in his (1960) article, avoided confronting this problem by using a linear 
approximation to (3) in his analysis. This device is permissible if and only if one can 
presume that the scientific tariff will be small, but that presumption cannot generally be 
valid since the whole point of the exercise is to find the level for the scientific tariff. 
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Fie. I 


that the solid area equals the slashed area, so the policymaker is 
indifferent between a tariff of t,. (where Z = 1 given C, P, e, and n) 
and one which is prohibitive (> ¢,). For any 8 > B2 a prohibitive tariff 
will always be preferred. Thus a small increase in 8 may cause a finite 
jump in the scientific tariff.® 

Similarly, a small change in any of the market parameters may 
cause a finite jump in the scientific tariff (by altering MB, MC, and t,). 
For example, beginning with MC and MB(f;,) as shown in figure 1, it 
is clear from equation (3) that an increase in e€ will shift MB up relative 
to MC, causing the scientific tariff to jump from f, to ¢, (which itself is 
reduced in magnitude since dt,/de < 0). The possibility of such discon- 
tinuities is completely absent from Johnson’s well-behaved but inap- 
propriate formula. Furthermore, any further increase in € will reduce 
the scientific (i.e., prohibitive) tariff, contrary to Johnson’s conclusion 
that “the implied scientific tariff structure entails tariff rates which are 
higher the higher the elasticity of supply . . .” (p. 343). 

In short, Johnson’s methodology applies only to a limited set of 
circumstances; more generally, the first-order conditions are neither 
necessary nor sufficient for identifying the “scientific tariff.” 


6 For example, if 8 = 1,C = 10, P = 1, and y = € = 1, thent, = .82 while t.. < .33 
(since ts. must be to the left of the peak of the cubic Z function, which occurs at 1/3”). 
Thus a slight change in the planner’s preference for protection, or any of the market 
parameters, could conceivably cause the scientific tariff to leap from below 33 percent 
up to 82 percent. 
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Moreover, the structure of protection implied by the correct calcula- 
tion may differ dramatically from that implied by Johnson’s proce- 
dure. 
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Discount Rate and Wealth 


G. S. Laumas 


Illinois State University 


In a recent issue of this Journal, Mohabbat and Simos (1978) provided 
estimates of the rate of discount using Kendrick’s (1976) series of total 
private wealth. Since Kendrick also provides estimates of the division 
of total wealth between human, W,, and nonhuman, W,, wealth, it 
should be of interest to investigate how the discount rates differ 
between the two components of wealth. 

In order to accomplish this, it is necessary to divide the total income 
into two parts—one corresponding to the human wealth, Y;, and the 
other to the nonhuman wealth, Y,. Kendrick (1976, p. 22) and Chris- 
tensen and Jorgenson (1973) provide an outline of the method for 
such a division of the total income. Following them, the time series for 
Y, and Y, were estimated. The estimated series closely approximate 
the estimates for “labor compensation” and “property compensation” 
given by Kendrick and Christensen and Jorgenson. 

As suggested in Mohabbat and Simos, the time-series estimates of 
the rate of discount for W;, and W, were computed using the varying 
parameter regression technique. These series are given in table 1. 

The obvious inference that can be drawn from the results given in 
table 1 is that the discount rate on W,, is consistently higher than the 
rate on W,. This evidence is important by itself. In the absence of 
direct empirical estimates, as provided here, economists had to make 
assumptions about the behavior of the two rates (see Pesek and Saving 
1967). 

Additionally, since the major portion of W,, is education, the results 
provide further evidence on what Welch has called “one of the most 
important phenomena of our time . . . that the rates of return to 
investments in schooling have failed to decline under the pressure of 
rapidly rising average educational levels” (Welch 1970, p. 54). The 
existence of a relatively higher rate on W,, confirms the conclusions 
about the behavior of the rate of return on education arrived at by 
Griliches (1970, 1977), Welch (1970), Becker (1975), and others. 
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TABLE I 


Time-SERIES ESTIMATES OF THE DiscoUNT RATE ON HuMAN 
AND NONHUMAN WEALTH, UNITED SraTes, 1929 ro 1969 











Discount Rate on Discount Rate on 
Wieat Human Wealth Nonhuman Wealth 
1969 15 .08 
1968 15 .08 
1967 15 09 
1966 15 .09 
1965 15 .09 
1964 15 .09 
1963 15 .08 
1962 mG) .08 
1961 16 .08 
1960 .16 08 
1959 16 .08 
1958 pion .08 
1957 pled .08 
1956 ley .08 
1955 ales .08 
1954 my, .08 
1953 a7) .08 
1952 Bia .08 
1951 Phi .08 
1950 wid .08 
1949 7) .08 
1948 Pal .08 
1947 alla .08 
1946 18 08 
1945 20 .08 
1944 21 .08 
1943 mal .08 
1942 .20 08 
194] 18 .07 
1940 16 .08 
1939 16 .08 
1938 16 .07 
1937 16 07 
1936 IG .08 
1935 14 07 
1934 14 .06 
1933 a3 .06 
1932 13 05 
1931 .14 OW 
1930 15 08 
1929 16 -09 
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Money, Inflation, and the Bank of Canada: An Analysis of Canadian Monetary Policy 
from 1970 to Early 1975. By Tuomas J. CoURCHENE. 
Montreal: C. D. Howe Research Institute, 1976. Pp. 290. $5.00. 


For 12 years, from 1950 to 1962, while other industrialized countries were 
operating a fixed exchange-rate regime under the Bretton Woods par value 
system, Canada was a maverick floater that provided economists with a labo- 
ratory in which the practical problems of implementing monetary and fiscal 
policy under a regime of flexible exchange rates could be observed and 
analyzed. Prior to the advent of greater exchange-rate flexibility among the 
developed countries in the early 1970s, the Canadian experience was fre- 
quently cited as an example of the degree to which, through the judicious use 
of monetary policy, a flexible exchange rate could be kept stable with a 
minimum of official intervention in the foreign exchange market. It is no 
coincidence that two Canadians, Robert A. Mundell and Harry G. Johnson, 
were major contributors to the early literature on the appropriate use of 
monetary and fiscal policy under alternative exchange-rate regimes. 

During the late 1950s and early 1960s, debate on the relative merits of fixed 
versus floating exchange rates was waged within Canada not as an abstract 
intellectual exercise, but as a policy issue of immediate practical interest to all 
Canadians. Canada returned to the fold of fixed exchange-rate countries in 
1962 and remained there for 8 years. When she again chose to break from the 
par value system on June 1, 1970 she briefly appeared once more as the odd 
man out. By 1974, however, the accumulating pressures of widely divergent 
inflation rates among countries, continuing U.S. balance of payments deficits, 
and rapid increases in the prices of basic commodities and energy had led to 
the collapse of the par value system, leaving the larger industrial countries 
with little choice but to opt for the present system of “managed floating” 
exchange rates. In this context, the experience of the Bank of Canada in 
conducting monetary policy under alternative exchange-rate regimes is of 
considerable importance to students of monetary economics and policymak- 
ers alike. It is hardly surprising that, despite its relative youth (it was estab- 
lished in 1934), the Bank of Canada has been the subject not only of detailed 
research by academic economists, but also of lively and sometimes acrimoni- 
ous public debate. 

Thomas J. Courchene’s careful study of the Bank of Canada during the 
crucial and fascinating period 1970-75 is a welcome addition to a substantial 
literature which includes works by Neufeld (1955), Johnson and Winder 
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(1962), Reuber (1964), the Royal Commission on Banking and Finance 
(1964), and, perhaps the best known, Paul Wonnacott’s study The Canadian 
Dollar, 1948-1962 (1965). Like these earlier works, Courchene’s book at- 
tempts to analyze and explain the policies of the Bank of Canada within the 
context of standard concepts of macroeconomics and monetary theory. But 
Courchene’s book is far from being a mere update of earlier research. His 
work differs in at least two ways. First, he seeks to explain his views not only to 
other economists but also to interested laymen, a desire which is the natural 
reflection of the position occupied by the Bank of Canada at center stage in 
the policymaking theater. Second, he approaches the analysis of monetary 
policy from an avowedly “monetarist” perspective. These considerations gov- 
ern the structure of his work. 

The book is divided into four main sections. Part 1 is devoted to an 
elementary elaboration of several important aspects of the monetarist theory 
of a small open economy under fixed and flexible exchange rates. Courchene 
emphasizes the distinction between nominal and real stocks of money and 
describes the different forces that act to equate money demand and money 
supply under alternative exchange-rate regimes. He then goes on to describe 
the concept of the natural rate of unemployment and spells out its implica- 
tions for what monetary policy can and cannot do. Part | is completed by a 
brief description of alternative approaches to monetary theory, particularly 
the neo-Keynesian approach. 

Part 2 of the book treats the theory of central banking in general and the 
Bank of Canada’s approach in particular. Courchene first considers the vari- 
ous indicators of monetary stringency or ease that were used in Canada 
during the 1970—75 period—nominal interest rates, the liquid asset ratio of 
commercial banks, M1 growth, etc. He argues that the Bank of Canada’s 
choice of policy instruments during the period was a logical consequence of its 
theory of how the Canadian financial system operated and particularly of the 
role of market imperfections in allowing the authorities to influence both “the 
cost of credit” and “credit conditions” to some extent independently. 
Throughout, Courchene has chosen the technique of comparing actual Bank 
of Canada policies with what would have been ideal from the point of view of 
monetarist theory. Such an approach is, of course, consistent with his objec- 
tive, but it does involve compromises. Monetary economists of a neo- 
Keynesian bent will not be entirely satisfied with Courchene’s rather sparse 
treatment of the implications of the Keynesian model for monetary policy. 
Courchene rightly argues that his purpose is not to create a straw man out of 
the neo-Keynesian approach, but it is still left to the reader to decide whether 
policies that do not appear optimal when viewed from a strict monetarist 
perspective are more reasonable from the point of view of the model that the 
Canadian monetary authorities had in mind. 

Fortunately, this question turns out to be somewhat beside the point, for 
the real meat of Courchene’s book is to be found in parts 3 and 4, which 
present a very detailed description and analysis of Canadian monetary policy 
during 1970-75. After an initial chapter covering the whole period, Cour- 
chene devotes a chapter to developments during each year. This method of 
organization, while ideal for a reference work, makes for rather heavy going 
when the book is being read straight through because it results in excessive 
repetition. Nevertheless, Courchene’s achievement in these sections is impres- 
sive. He manages to present a careful and incisive analysis that integrates a 
great deal of statistical material on the economy’s performance with the 
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authorities’ own public explanations of their policies, drawn mainly but not 
exclusively from Bank of Canada publications. As a result, the comparison of 
actual policy with the monetarist paradigm becomes less important than the 
detailed description of events. 

Courchene’s thesis is that the floating of the Canadian dollar in June 1970 
was caused by the Bank of Canada’s attempt to pursue a less inflationary 
policy than the rest of the world, a policy that could only succeed if the 
exchange rate was flexible. But Courchene argues that “when the rate floated 
and Canada was finally allowed the independence to pursue its own policies 
with respect to the behavior of prices, the concern over inflation was immedi- 
ately jettisoned in favor of ensuring that the exchange rate was set at an 
‘appropriate’ level” (p. 161). This change in emphasis, in Courchene’s view, 
ushered in a period of excessive monetary expansion that persisted from 
1971 until 1975. All this is supported by the marshaling of an impressive 
array of facts and figures. One may choose to disagree with Courchene’s views 
about the monetary authorities’ motives and intentions, but it is not easy to 
dispute the broad outlines of his argument. Along the way, the reader is 
treated to numerous interesting details: the difficulties experienced by the 
authorities in using indicators and instruments based on the asset side of 
commercial bank balance sheets; the subtle web of problems arising out of the 
federal government's cash deficit and the use of Canada Savings Bond issues 
to finance its needs; and reliance on the transfer of government deposits, 
rather than standard open-market operations, to alter the level of bank 
reserves. 

On the basis of the evidence, few would dispute Courchene’s conclusion 
that rapid rates of monetary expansion fueled inflation in Canada in 
1971-75. But this criticism is somewhat tempered when the Bank of Canada’s 
policies are compared with the actual track record of central banks in other 
countries during the same period rather than with the monetarist paradigm. 
Furthermore, by late 1975 the Bank of Canada, like several other central 
banks, had announced its intention to place greater emphasis on quantitative 
monetary targets, a fact recognized by Courchene in a postscript apparently 
inserted as his book went to press. Whatever the importance of these shifts in 
the thrust of monetary policy, Courchene’s book will undoubtedly become a 
standard reference work for the 1970—75 period, and the thoroughness with 
which he has analyzed monetary policy in Canada will serve as a firm basis for 
the continuing debate on this subject. 


MALCOLM KNIGHT 
International Monetary Fund 
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A Life for Sound Money: Per Jacobsson. His Biography. By ERtn E. JACOBSSON. 
Oxford: Clarendon Press, 1979. Pp. xxv + 428. £16. 


Erin Jacobsson has written a lively and readable life of her father, covering the 
many facets of a remarkable career. She did not lack for material: 6 meters of 
correspondence; a diary covering over 50 years in such detail that for the later 
years it would fill several volumes; some 500 articles; innumerable speeches 
and reports, including the greater part of the Bank for International Settle- 
ments (BIS) Annual Reports for 25 years. This is drawn upon to paint a 
picture of the man and his many activities over the years, along with a 
necessarily brief account of the background to some of those activities. A 
concluding chapter deals with his economic thought. 

From the time he joined the League of Nations Secretariat in London in the 
spring of 1920, until his death in office as managing director of the IMF in 
1963, Per Jacobsson was almost continuously in the service of a key interna- 
tional organization. There was a break of 2% years in 1929-31 when he 
returned to Sweden. There for a time he was economic adviser to Ivar 
Kreuger, the Match King, not long before the latter’s suicide and the sensa- 
tional collapse of Kreuger and Toll. But even in those years he spent much of 
his time abroad on international business. Wherever he went as a wandering 
economic adviser he seemed to be in the thick of the action, with an extraor- 
dinary knack of positioning himself where he could best observe, analyze, and 
influence events. His influence was felt over nearly half a century, not only 
within the international organizations for which he worked, but among those 
responsible for economic and financial policy throughout the world. 

What was the secret of his influence as an economic adviser? Part of the 
answer lies in his personality. Always a little larger than life, he was the most 
clubbable of men, full of good stories, parables, and telling facts, persuasive 
and witty, confident in simple remedies and in his power to expound them, 
the world’s outstanding salesman of sound finance. A man so entertaining 
and heart warming was sure of a hearing. Added to that was an extremely 
well-stored mind: he took endless pains to master the factual background to 
financial problems, to know what was going on and put together what he 
could learn from others about past experience and current developments. It 
was not for nothing that he built up an excellent daily summary of financial 
news at the BIS and immersed himself so deeply in the preparation of its 
annual reports. When he had to suggest remedies he could start from a 
thorough diagnosis and take full account of special circumstances without 
limiting himself to purely general propositions. 

His approach was neither that of the economic forecaster nor that of the 
pure theorist. His concern was to make sure that the right policy instruments, 
especially monetary instruments, were brought into use and that they oper- 
ated in the right direction. Thus he was no enthusiast for fine tuning and was 
more given to laying emphasis on keeping costs under control than on the 
need for demand management. He never put together any extensive analysis 
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of the impact of monetary policy on output and prices but took for granted 
that it was at once the most powerful and most delicate of instruments 
available for maintaining internal and external balance. 

His use of economic theory was almost always at a comparatively elemen- 
tary level. But he could put his finger on the key issues and had a deep 
intuitive understanding of the complex interactions that govern economic 
development. He had courage, common sense, and conviction, so that he did 
not hesitate to repeat unpopular doctrine until it won acceptance. Above all, 
he had a very good track record. He had foreseen, well in advance, the fall in 
long-term interest rates in the thirties. He was never much impressed by the 
prediction of a prolonged dollar shortage and was one of the first to stress the 
growth in dollar liabilities. He also realized very early—certainly by 1942— 
that the postwar problem was going to be one of coping with inflation rather 
than with unemployment. The emphasis of what he had to say moved in- 
creasingly, therefore, with the temper of the times. 

The danger that he foresaw was that cheap money and deficit spending 
would continue in circumstances to which they were quite inappropriate. 
They would issue inevitably in excess demand, external deficits, and appeals 
for aid from abroad when policies designed to secure “internal balance” 
would make such aid superfluous. Such policies should be designed to restore 
monetary discipline, and it was on the monetary component that he kept 
harping until discount rates were increased in the early fifties. The message 
that he sought to emphasize was the primacy of monetary policy in maintain- 
ing price stability and the need simultaneously to limit public spending and 
preserve budgetary balance, if not year by year at least over a period of years. 
This was the point at which he took issue with Keynesian thinking, and it was 
also at the root of his popularity with central bankers. 

It happens also to be the point at which he and I parted company. He had 
no use for the Radcliffe Committee Report of which I was a signatory. He 
found convincing evidence of the importance of credit control in Germany’s 
experience in 1950-51, when an increase in the discount rate from 4 to 6 
percent was followed by a gradual but pronounced swing of the German 
balance of payments into surplus. But this still seems to me a special case in 
which, for obvious reasons, stock building of imported materials had been 
carried to excess, and it was only a question of time before the steep upward 
trend in German exports predominated over speculative excesses. In general, 
the early postwar period was dominated by the logistical difficulties involved 
in regaining prewar levels of production and trade, and cheap money was not 
a universally important handicap in the restoration of internal balance. 

However all that may be, there was no doubt about Jacobsson’s priorities. 
He put price stability first; and since he shared neither Keynes’s doubts about 
the self-regulating character of the economy nor Keynes’s optimism about 
managing the level of activity with success, he was content to let employment 
look after itself. In some respects, therefore, he came close to the monetarist 
position. But he disagreed strongly with Milton Friedman on monetary man- 
agement, insisting on the need for flexibility. He was also insistent on the need 
for what he called “a balance between prices and costs.” It is not altogether 
clear what he meant by this, since one might suppose that the relationship 
between the two is governed by the state of demand. It would seem that what 
he was after was the danger that profit margins might be compressed by 
excessive wage claims or misplaced tax policies and the need to take account 
of cost and supply factors before deciding on the timing or scale of any boost 
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to demand. He envisaged the use by governments of measures designed to 
influence costs directly, including what has come to be called “incomes pol- 
icy,” so that here again he was by no means in agreement with present-day 
monetarists. 

Although he was doubtful whether the influence of Keynes had been on 
balance healthy, his economic philosophy had much in common with 
Keynes’s. He was, like Keynes, a believer in management: both the manage- 
ment of money and the management of the economic system through finan- 
cial mechanisms. He was no believer in controls; but, like Keynes, he saw the 
need to find a way of stabilizing the wage level with the agreement of all 
parties concerned. Although he disliked deficit spending and public works, he 
was not prepared to condemn them in all circumstances. Even more than 
Keynes, he was essentially a nineteenth-century liberal, against nationalization 
and government subsidies, in favor of a market economy and free trade, 
championing the small against the large, private enterprise against govern- 
ment control. 

The ideas for which Jacobsson contended gained increasingly in influence 
over the postwar world. Yet he never succeeded in committing them system- 
atically to paper in the form of a treatise. This he regretted deeply. His 
compulsive need to talk—‘“the daily spreading of the substance of my 
gifts’—took precedence over the need to construct a theoretical scaffolding 
for his ideas. But there is no doubt that he made the right choice and that his 
forte lay in economic policymaking rather than in theoretical virtuosity. He 
left behind in his diary and his writings an ample record of the events in 
which he took part and of the ideas governing his approach to them. 


ALEC CAIRNCROSS 
Oxford 


Keynes’ Monetary Thought: A Study of Its Development. By Don PATINKIN. 
Durham, N.C.: Duke University Press, 1976. Pp. 155. $9.75. 


It is a truly amazing story. He was a renowned teacher, scholar, and writer, an 
influential commentator on world affairs, an expert always consulted by his 
government, an intellectual active in partisan politics, a successful operator in 
business and finance, and much more. At the age of 47, he had just published 
his magnum opus, a two-volume treatise setting forth his mature conclusions on 
the entire range of his scientific expertise. The eagerly awaited work excited 
great interest in his profession, but considerable criticism too. The author 
himself was dissatisfied. Even before the ink was dry, he was changing his 
mind and working on the sequel. This task became a corporate enterprise, 
engaging not only the critical wisdom of his peers but the loyal enthusiasm of 
one of the most remarkable concentrations of talented young scholars in 
academic history. Seminars and lectures quickly turned from glosses on the 
completed work to draft chapters of the new. Over 5 years of constant 
conversation and correspondence, the master, his young disciples, and 
selected critics, friendly and not so friendly, argued out the new doctrines. He 
and his disciples never doubted they were at last figuring out how the world 
really works, solving puzzles that had stumped their predecessors for a cen- 
tury. They did not rely on experimental findings, statistical inference, histori- 
cal study, or mathematical innovation, but on sheer logic and insight. Their 
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parochial confidence that truths worth knowing arose inside a circle of a 
40-mile radius would have been ridiculous arrogance had it not been so 
nearly justified. Finally, the new book appeared and wrought the revolution 
its makers anticipated, not only commanding the attention of the profession 
worldwide for decades to come, but profoundly influencing the policies and 
politics of nations. 

The two books were, of course, Keynes’s Treatise on Money (1930) and his 
General Theory (1936). The publication of The Collected Writings of John M aynard 
Keynes by the Royal Economic Society makes available many details of the 
story, and more are accessible in unpublished documents among the Keynes 
Papers in the Marshall Library at Cambridge. It is the good fortune of the 
profession that a great monetary theorist, Don Patinkin, has undertaken 
active research in the history of monetary theory and in particular on Keynes. 
In the brief volume under review he traces Keynes's monetary and mac- 
roeconomic thought through his three major works, beginning with the Tract 
on Monetary Reform (1923), stressing the story sketched above of the two later 
books. Keynes’s multiple roles in science and public life make for an engross- 
ing case study of the interactions of theoretical developments with the events, 
problems, and policy controversies of the times. The intellectual and personal 
relationships of Keynes to his older colleagues (Pigou, Robertson, Hawtrey, 
Hayek), to his young disciples at Cambridge (Kahn, Joan Robinson, Austin 
Robinson, Sraffa) and at Oxford (Harrod, Meade), and to others (including 
Kaldor, Lerner, Myrdal, Ohlin, Hicks) are a fascinating episode in the sociol- 
ogy of scholarship. Patinkin’s concise narratives whet the appetite, and he 
refers us to works by Winch, Moggridge, and Howson for full meals. 

Here Patinkin’s main emphasis is on theory. He clearly explains and com- 
pares propositions of the three books and related articles, and he seeks 
reasons for the changes. Most readers will be particularly interested in Patin- 
kin’s views of what Keynes “really meant” in the General Theory. With admira- 
ble restraint, Patinkin refrains from interpreting or appraising Keynes from 
the hindsight of today’s theory and knowledge, even his own. 

The Tract Patinkin finds fascinating on the national and international 
problems and policies of the era, but in monetary economics he finds it a 
routine recital of the Cambridge version of the quantity theory. The Treatise 
he finds dull and mechanical—“a Keynes out of character, a Keynes attempt- 
ing to act the role of a Professor, and a Germanic one at that” (p. 24). The 
famous Fundamental Equations are fundamentally flawed, and the objective 
of explaining output fluctuations is bound to elude a theory that is designed 
to explain prices and profits for given output but lacks equations for output 
determination. It is this gap that led Keynes to the theory of effective de- 
mand, in Patinkin’s view the central insight of the General Theory. For the 
General Theory and for Keynes’s lifetime contribution Patinkin is a great 
enthusiast. In these days, when Keynes’s standing with the profession and 
with the general public is cyclically low, when the man’s writings and influence 
are blamed for all the world’s ills, students and young economists will gain 
valuable perspective from Patinkin’s book. 

On the main points of interpretation I agree with Patinkin. (1) The preten- 
sion to establish an equilibrium with involuntary unemployment, which puts 
off so many economists instilled with neoclassical instincts, should not be 
taken seriously. Keynes is really describing an economy in disequilibrium, or 
in a succession of constrained temporary equilibria, using comparative statics 
as the analytical language of the day. His strong conviction and his true 
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message are that the automatic self-righting mechanisms of whole economies 
are slow, weak, and unreliable. Excess supply can persist for long periods, 
whether one calls the situation equilibrium or not. (2) Keynes did not assume 
either money wage rigidity or money illusion. He did contend that wages 
would adjust downward slowly—more slowly than prices could move upward. 
And he was skeptical of the employment effects of economy-wide wage 
reduction. As Patinkin says, he regarded wage and price deflation as the 
equivalent of monetary expansion. Neither might be capable of lowering 
interest rates enough to revive investment in depressed times. (3) Keynes 
emphasizes nonpolicy disturbances to the economy, real as well as monetary. 
The “state of long-term expectation” by entrepreneurs and investors is a 
major determinant of effective demand. Since these expectations relate to 
essentially unknowable and nonprobabilistic future events, including the ex- 
pectations of future entrepreneurs and investors, they are arbitrary and 
volatile. Keynes also stresses, as in the Treatise, the importance of differences 
of view between savers and investors, or bulls and bears, or lenders and 
borrowers. (4) The Hicksian IS/LM apparatus captures beautifully the 
general equilibrium character of Keynes’s vision, but its concise formalism 
does not do justice to the above insights and others. (In early drafts of the 
book and in lectures Keynes wrote symbols for expectations or “state of the 
news” as arguments of functions, but they were not in the book for Hicks to 
find.) 

My differences from Patinkin’s observations are minor. (1) He describes (p. 
81) the financial side of the General Theory as a three-asset model (money, 
bonds, equities), in contrast to the two-asset model (money, equities) of the 
Treatise. This seems to be stretching things. It seems to me that the General 
Theory really has only two assets (money, everything else). Anyway, there is 
only one endogenous interest rate, that to which the estimated marginal 
efficiency of capital is to be equated by variation of investment—a stock-flow 
confusion, as Lerner observed early on. In treating the portfolio choice 
between equities and fixed-money-value assets and in considering the way the 
market brings together “bulls” and “bears,” I think the Treatise is superior, 
dealing with more interesting and important matters than speculation about 
bond interest rates. (2) Patinkin notes with favor (p. 36) that the Treatise 
abandoned the quantity-theory presumption, followed in the Tract, that mon- 
etary measures first alter the quantity of money and only then, via this 
medium, affect spending, profits, prices, and output. In the Treatise central 
bank operations change interest rates and thereby affect all the variables, 
including the quantity of money, simultaneously. The General Theory appears 
to be a step backward in this respect. (3) Patinkin allows himself to wonder (p. 
110) about Keynes’s omission of wealth, in distinction from capital gains, as a 
determinant of consumption. Keynes in fact does recognize life-cycle effects, 
and he would properly not wish to count planned or expected accumulations 
of savings as another determinant. But why does he not recognize real gains 
or losses on money balances? The answer is not clear. Perhaps it is because 
none of Keynes's theoretical work contemplates drastic or continuing one-way 
price change. As in other monetary theory of the day, the quaesitum was the 
determination of the equilibrium value of money, that is, the price level, not 
the inflation rate or any other dynamic price path. Given this mind-set, 
Keynes might naturally regard short-run price variation as too transient and 
reversible to enter consumers’ reckoning of wealth. The same mind-set must 
be the reason that Keynes did not see inflation as a way of lowering real 
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interest rates even when nominal rates were close to their floor. (4) Greater 
stress might be placed on Keynes’s uncritical acceptance of the neoclassical 
competitive model. By assuming that firms are price takers in auction markets 
rather than price setters in monopolistic competition or oligopoly, he made it 
harder to sustain his vision of persistent disequilibrium, with failures of 
coordination, communication, and adjustment. Imperfect competition was 
the other revolution in economics in the 1930s; one of its sites was Keynes’s 
Cambridge, and two of its agents, Joan Robinson and Sraffa, were in his 
group. Yet for some mysterious reason the two revolutions were never 
meshed. 

Keynes planned, Patinkin reports, to write a book of “Footnotes to the 
General Theory,” but was never able to do so. The fourth book, Patinkin feels, 
would have been one of continuation and clarification of the third. Keynes 
had no reason to rethink the General Theory as he had the Treatise. 

Which of the several Keynesian traditions today is authentic? Any or all? 
Patinkin does not enter this dangerous terrain. We can hope he will tell us on 
some other occasion. Meanwhile we can look forward to more books that 
combine, as this one does, fascinating narrative, economic history, history of 
thought, and lucid exposition of difficult theory. 


JAMES TOBIN 
Yale University 


Studies in Macroeconomic Theory. Volume 1: Employment and Inflation. By E>mMunD 


S. PHELPS. 
New York: Academic Press, 1979. Pp. xii + 418. $19.50. 


This volume reprints 17 papers by Phelps that were originally published 
between 1965 and 1978, together with one previously unpublished paper 
from 1959 and some newly written material in the form of seven brief 
commentaries on the origins and aims of the papers and a synthetic intro- 
ductory essay laying out a theory of price, wage, and employment fluctua- 
tions. Phelps’s work is deep, insightful, and technically challenging. Many of 
the reprinted papers are well known and have been influential in the rapid 
development of macroeconomic analysis in recent years. 

My candidate for the most significant paper in the volume is “Phillips 
Curves, Inflation Expectations, and Optimal Employment over Time” (1967). 
In this paper Phelps introduced the natural-rate hypothesis and the associ- 
ated acceleration hypothesis into macroeconomic analysis, although he mod- 
estly reports that he learned these ideas from his mentors William Fellner and 
Henry Wallich. The natural-rate hypothesis probably has proven to be the 
most important new idea in macroeconomics since 1936. 

Milton Friedman’s apparently independent invention of the natural-rate 
hypothesis in his 1967 presidential address to the American Economic Associ- 
ation represents a remarkable coincidence in the history of the subject. In- 
terestingly, however, Phelps and Friedman in their original presentations 
drew radically different lessons from the natural-rate hypothesis. For Fried- 
man, the natural-rate hypothesis provided another reason for believing that 
activist stabilization policy offers little potential benefit to offset its inherent 
risks. For Phelps, the natural-rate hypothesis provided a new and welcomed 
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challenge to the technical virtuosity of students of optimal control techniques, 
a reaction that reflects an essential part of his implied view, discussed further 
below, of the relevance of economic analysis for economic policy. 

The contents of this volume mostly are concerned with two problems: the 
development of positive models of economic behavior that can account for 
observed macroeconomic fluctuations, and the development of normative 
criteria for macroeconomic policy based on a dynamic theory of optimal 
inflation. With regard to the former problem, the newly written introductory 
essay attempts to consolidate a theory of macroeconomic fluctuations that, as 
Phelps puts it, is “developed piecemeal” in the reprinted papers. 

In my view, this essay fails to confirm Phelps’s claim that there now exists a 
satisfactory theory of employment fluctuations that explains why nominal 
disturbances apparently create a temporary divergence between actual and 
expected inflation rates and thereby affect real variables in addition to prices 
and wages. Recent years have seen progress, to which Phelps’s work has 
contributed, in clarifying the objectives for such a theory, but have not seen 
the attainment of these objectives by either Phelps or other theorists. Both of 
the main existing approaches to this problem—the non-market-clearing 
models, exemplified in this volume by the introductory essay and the paper 
written jointly with John Taylor, “Stabilizing Powers of Monetary Policy 
under Rational Expectations” (1977), and the incomplete-information mod- 
els, developed most fully by Robert Lucas and Robert Barro—seem to rely 
either explicitly or implicitly on contrived and unconvincing impediments to 
Walrasian outcomes. 

In Phelps’s new work, specifically, the key assertion is that costs of informa- 
tion transmission cause delays in market-clearing price and wage adjustments. 
This idea, however, receives neither formal theoretical development nor 
empirical confirmation. Phelps’s substantial contributions to the ongoing 
work on a theory of macroeconomic fluctuations actually have largely in- 
volved qualification and embellishment of the essential clarification embodied 
in the natural-rate hypothesis—namely, that a Phillips curve consistent with 
neoclassical economic postulates would relate divergencies of unemployment 
from its equilibrium, that is, “natural,” level to the difference between the 
inflation rate and some measure of the expected inflation rate, rather than 
simply to the inflation rate. 

One accomplishment of the newly written material is to clear up am- 
biguities and inconsistencies in some of the reprinted papers with regard to 
market-clearing assumptions. As indicated above, Phelps now prefers the 
non-market-clearing approach, and the new introductory essay, in contrast to 
some of his earlier work, assumes that both labor and product markets fail to 
clear. His main defense of the non-market-clearing approach is that he 
regards the alternative incomplete-information approach as unrealistic, be- 
cause when combined with assumptions about rational expectations it implies 
that only unsystematic components of monetary policy affect real variables. 
Phelps, however, discusses no empirical evidence that supports rejection of 
this hypothesis. Actually, the existing empirical analysis seems to be inconclu- 
sive. Moreover, here as elsewhere, his discussion of the existing literature is 
impressionistic and does not include either detailed examination of alterna- 
tive models or even specific references. 

Phelps also refers to “the patent unrealism of the view [implicit in the 
incomplete-information approach] that unemployment is wholly voluntary” 
(p. 27). However, as many authors, including myself, have pointed out, this 
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criticism is either semantic or wrong. Specifically, the recently developed view 
of labor market transactions as involving implicit contractual arrangements, 
which Phelps acknowledges to be a useful contribution, explains layoffs and 
other apparent evidence of nonwage rationing of employment without in- 
voking a failure of labor markets to clear. 

Phelps alludes in a few places to what seems to me to be the most telling and 
fundamental problem with the incomplete-information approach, incorpo- 
rating rational expectations and market clearing, to modeling macroeconomic 
fluctuations. This problem is that the assumptions of this theory apparently 
imply that perceived monetary disturbances would not affect real variables, a 
hypothesis that seems inconsistent with the apparent relation between pub- 
lished monetary data and measures of output and employment. To get 
around this problem, the Lucas and Barro formulations of this theory im- 
plicitly employ the contrivance that rational individuals disregard current 
monetary data. 

With regard to the implicitly contractual view of labor market transactions 
referred to above, Phelps emphasizes a variant, developed in a short paper 
written jointly with Guillermo Calvo, “Employment Contingent Wage Con- 
tracts” (1977), that assumes that employers know more about realized states of 
nature than do their workers. The significance of this assumption, however, 
seems questionable because, contrary to what Calvo and Phelps assume, 
employers in competitive labor markets probably would not find it worthwhile 
to exploit such an informational advantage. In any event, even with the 
Calvo-Phelps assumption about firm behavior, the contractual model does not 
imply that employment fluctuations should be related to published monetary 
data. Unfortunately, Calvo and Phelps obscure this point by implicitly intro- 
ducing another questionable assumption, that the monetary authority has 
superior information about real and velocity shocks. 

In addition to macroeconomic fluctuations, the other main focus of this 
volume is, as indicated above, a theory of optimal inflation. Phelps is in- 
terested in prescribing inflation policy both in the context of a deterministic 
steady state and in stochastic and non-steady-state contexts in which optimal 
control techniques become relevant. The analysis of these issues is com- 
prehensive and introduces many subtleties and complexities. Examples of 
fundamental insights are the viewing of the problem of optimal inflation as 
involving the choice of an optimal tax mix and as being parallel to the 
problem of optimal growth. Importantly, Phelps clearly distinguishes and 
explains the respective roles of empirical knowledge and ethical postulates in 
prescribing policy. 

It is worth recalling in this context that Phelps rejects models of mac- 
roeconomic fluctuations that assume market clearing and rational expecta- 
tions. Such models imply that feedback control is irrelevant. In his more 
recent papers, however, Phelps takes careful account of how the introduction 
of rational expectations in place of adaptive expectations in non-market- 
clearing models radically changes the control problem. In “Disinflation with- 
out Recession: Adaptive Guideposts and Monetary Policy” (1978), he devises 
a subtle counterinflationary plan involving “a dynamic monetary program which 
is calculated to maintain full employment insofar as the average wage ap- 
proximates its warranted path, and the institution of some form of indicative 
wage planning to guide and promote the growth of wages along that warranted 
path” (p. 256). In advocating this plan, he also carefully assesses the questions 
about economic behavior that make its success less than certain. 
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Phelps’s analysis of optimal macroeconomic policy involves challenging 
intellectual exercise and undoubtedly enhances understanding of important 
economic relations. His discussions of policy issues, however, both in this 
volume and in other writings, also illustrate the confusion prevailing among 
us academic economists about the relevance of normative economic analysis 
for actual policymaking in real economies. In his critique, “The 1972 Report 
of the President’s Council of Economic Advisers [CEA]: Economics and 
Government,” published in the September 1972 issue of the American Eco- 
nomic Review (vol. 62, pp. 533-39) and referred to but not reprinted in the 
present volume, Phelps observes in a tone of anger and frustration “that 
economic analysis does not occupy a strategic place in the selection of eco- 
nomic policy” (p. 538). But he does not seem to be willing to entertain the 
possibility that the explanations for this observation may be systematic and 
immutable. 

In criticizing the CEA, Phelps describes the so-called game plan of 1969-71 
as involving both deception and self-deception, and he suggests that these 
symptoms are chronic characteristics of governmental inability to make effec- 
tive use of scientific knowledge. In “Stopover Monetarism: Supply and De- 
mand Factors in the 1972-74 Inflation” (1975), he attributes the record of 
poor monetary policy to inadequate quantitative knowledge about the econ- 
omy. In his 1972 book, Inflation Policy and Unemployment Theory (New York: 
Norton), he writes of “the hope, however unwarranted, of [the cost-benefit 
approach to inflation planning] having an early policy impact” (p. 1x). 
Moreover, with regard to alternative inflation policies, he acknowledges “the 
extraordinary difficulties of estimating costs and benefits” (p. xvii), as well as 
the fear that “if the lid is taken off the inflation question . . . the public will 
simply bungle its inflation decisions” (p. xv). At the end of this discussion, 
however, he succumbs to apparently wishful thinking when he writes, “I 
believe there could be a significant gain from adopting a less ritualistic, more 
‘calculating’ approach to inflation policy” (p. xvi) and finally offers the advice 
to sink or swim when he writes, “Getting hold of our democratic policy- 
making potentialities is as much the point as the modest though valuable 
economic gain that can be expected to result” (p. xvi). 

It is interesting that, in “Inflation Planning Reconsidered” (1978), Phelps 
discusses precautionary considerations that take account of the political in- 
ability to take appropriate action. But, he also displays unrealistic, ivory-tower 
faith in the importance of logic and ideas. For example, he apparently 
regards such constructs as “the social utility function,” “the Rawlsian 
maximum criterion,” and “the intersection of weighted individual prefer- 
ences and technically feasible choices” to be potentially operational and to 
provide a meaningful basis for actual policymaking. In a newly written com- 
mentary, he admits to being amazed that the natural-rate hypothesis, a 
straightforward implication of neoclassical theorizing, turned out to be so 
controversial. In “Commodity-Supply Shock and Full-Employment Monetary 
Policy” (1978), he fancifully attributes a decision not to increase the money 
stock to accommodate supply shocks to Federal Reserve acceptance in 1974 of 
the latest, and at that date still unpublished, theoretical work implying that 
nonaccommodation is harmless. 

Phelps clearly does not appreciate the correct perception of many laymen 
that much professional economic advice is based on academic fads. In the 
newly written material he states, “It may be that, as a methodological device, 
the postulate of rational expectations will prove to be irresistible—whatever 
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our doubts over its substantive reality” (p. 28). But does he really expect the 
public to take seriously prescriptions based on nothing more than profes- 
sional prejudice? And how are laymen to distinguish solid advice from un- 
tested hypotheses? My guess is that attempts to apply the latest intellectual 
speculation to policy questions are subversive to efforts to enhance the role of 
fundamental economic analysis in policymaking. 
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Interpreting Economic Time Series 











Thomas J. Sargent 


Unwersity of Minnesota and Federal Reserve Bank of Minneapolis 


This paper explores some of the implications for econometric prac- 
tice of the principle that people’s observed behavior will change 
when their constraints change. In dynamic contexts, a proper 
definition of people’s constraints includes among them laws of mo- 
tion that describe the evolution of the taxes they must pay and the 
prices of the goods that they buy and sell. Changes in agents’ per- 
ceptions of these laws of motion (or constraints) will in general 
produce changes in the schedules that describe the choices they 
make as a function of the information that they possess. Until very 
recently, received dynamic econometric practice ignored this princi- 
ple. The practice of dynamic econometrics should be changed so that 
it is consistent with the principle that people’s rules of choice are 
influenced by their constraints. This is a substantial undertaking and 
involves major adjustments in the ways that we formulate, estimate, 
and simulate econometric models. 


Introduction 


This paper explores some of the implications for econometric practice 
of a single principle from economic theory. This principle is that 
people’s observed behavior will change when their constraints change. 
In dynamic contexts, a proper definition of people’s constraints in- 


The views expressed here are solely mine and do not necessarily represent the views 
of the Federal Reserve Bank of Minneapolis or the Federal Reserve System. Many of 
my thoughts on the subject of this paper have been heavily influenced by numerous 
discussions with’ Lars Peter Hansen and Robert E. Lucas, Jr. The observations on 
Bayesian methods are in large part those of Hansen. Ian Bain made many helpful 
comments on an earlier draft. This paper is the text for the Mary Elizabeth Morgan 
Prize lecture, given at the University of Chicago in November 1979. 


[Journal of Political Economy, 1981, vol. 89, no. 2] 
© 1981 by The University of Chicago. 0022-3808/81/8902-0007$01.50 


213 


214 JOURNAL OF POLITICAL ECONOMY 


cludes among them laws of motion that describe the evolution of the 
taxes they must pay and the prices of the goods that they buy and sell. 
Changes in agents’ perceptions of these laws of motion (or con- 
straints) will in general produce changes in the schedules that describe 
the choices they make as a function of the information that they 
possess. Until very recently, received dynamic econometric practice 
ignored this principle and routinely deduced policy conclusions by 
assuming that people’s rules of choice would not vary, for example, 
with the government’s choices of laws of motion for variables such as 
tax rates, government purchases, and so on. These variables are 
supposed to have their effects precisely because they influence the 
constraints of some private agents. 

The practice of dynamic econometrics should be changed so that it 
is consistent with the principle that people’s rules of choice are 
influenced by their constraints. This is a substantial undertaking and 
involves major adjustments in the ways that we formulate, estimate, 
and simulate econometric models. Foremost, we need a stricter 
definition of the class of parameters that can be regarded as “struc- 
tural.” The body of doctrine associated with the “simultaneous 
equations” model in econometrics properly directs the attention of the 
researcher beyond reduced-form parameters to the parameters of 
“structural equations,” which presumably describe those aspects of 
the behavior of people that prevail across a range of hypothetical 
environments. Estimates of the parameters of structural equations are 
needed in order to analyze an interesting class of policy interventions. 
Most often, however, included in a prominent way among the 
“structural equations” have been equations describing the rules of 
choice for private agents. Consumption functions, investment sched- 
ules, demand functions for assets, and agricultural supply functions 
are all examples of such rules of choice. In dynamic settings, regard- 
ing the parameters of these rules of choice as structural or invariant 
under interventions violates our simple principle from economic 
theory. 

This paper describes methods for interpreting economic time series 
in a manner consistent with the principle that people’s constraints 
influence their behavior. For the most part, I shall restrict things so 
that the dynamic economic theory is of the equilibrium variety, with 
optimizing agents.and cleared markets. However, many of the princi- 
ples described here will pertain to other types of dynamic economic 
theories, such as “disequilibrium” models with optimizing agents. The 
line of work I shall describe has diverse antecedents, of which major 
ones are contributions of Muth (1960, 1961), Nerlove (1967), Lucas 
and Prescott (1971), Telser and Graves (1971), and Lucas (1972b, 
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1976). The works of Granger (1969) and Sims (1972) have provided 
key technical econometric foundations. 

The basic idea is to interpret a collection of economic time series as 
resulting from the choices of private agents interacting in markets 
assumed to be organized along well-specified lines. The private agents 
are assumed to face nontrivial dynamic and stochastic optimization 
problems. This is an attractive assumption because the solutions of 
such problems are known to imply that the chosen variables (e.g., 
stocks of factors of production or financial assets) can exhibit serial 
correlation and cross-serial correlation. Since time series of economic 
data usually have the properties of high own-serial correlation and 
various patterns of cross-serial correlation, it seems that there is po- 
tential for specifying dynamic preferences, technologies, constraints, 
and rules of the market game that roughly reproduce the serial 
correlation and cross-correlation patterns in a given collection of time 
series measuring market outcomes. If this can be done in such a 
fashion that the free parameters of preferences, technologies, and 
constraints are identifiable econometrically, it is then possible to 
interpret the collection of time series as the outcome of a well- 
specified dynamic, stochastic equilibrium model. This paper is in- 
tended as a nontechnical summary of some of the econometric and 
theoretical issues involved in interpreting data in this way. 

But why should anybody want to interpret time-series data as rep- 
resenting the results of interactions of private agents’ optimizing 
choices? The answer is not that this way of modeling is aesthetically 
pleasing, although it is, nor that modeling in this way guarantees an 
analysis that implies no role for government intervention, which it 
does not. The reason for interpreting time series in this way is practi- 
cal: potentially it offers the analyst the ability to predict how agents’ 
behavior and the random behavior of market-determined variables 
will each change when there are policy interventions or other changes 
in the environment that alter some of the agents’ dynamic constraints. 
There is a general presumption that private agents’ behavior and the 
random behavior of market outcomes both will change whenever 
agents’ constraints change, as when policy interventions or other 
changes in the environment occur. The most that can be hoped for is 
that the parameters of agents’ preferences and technologies will not 
change in the face of such changes in the environment. If the dynamic 


1 Examples of work in the general line are Holt et al. (1960), Craine (1975), Crawford 
(1975), Geweke (1977), Sargent (1977, 1978), Blanco (1978), Hansen and Sargent 
(1979, 1980a, 19806), Taylor (1979, 1980), Huntzinger (1979), Kennan (1979), Meese 
(1979), and Nerlove, Grether, and Carvalho (1979). The philosophy of this work is 
reviewed by Lucas and Sargent (1978, 1980). 
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econometric model is formulated explicitly in terms of the parameters 
of preferences, technologies, and constraints, it will in principle be 
possible for the analyst to predict the effects on observed behavior of 
changes in the stochastic environment. 

Past dynamic econometric studies should usually be regarded as 
having been directed at providing ways of summarizing the observed 
behavior of interrelated variables, without attempting to infer the 
objectives, opportunities, and constraints of the agents whose deci- 
sions determine those variables. Most existing studies can be viewed, 
at best, as having estimated parameters of agents’ decision rules for 
setting chosen variables as functions of the information they possess. 
Most of the better studies of consumption, investment, asset demand, 
and agricultural supply functions must be interpreted as having 
estimated such decision rules. Dynamic economic theory implies that 
these decision rules cannot be expected to remain invariant in the face 
of policy interventions that take the form of changes in some of the 
constraints facing agents. This means that there is a theoretical pre- 
sumption that historical econometric estimates of such decision rules 
will provide poor predictions about behavior in a hypothetically new 
environment. This was Lucas’s (1976) critique of econometric policy 
evaluation procedures as they existed in 1973. 

Some readers of Lucas (1976) have interpreted the message as a call 
to evaluate policies by using existing econometric models differently? 
However, one implication of Lucas’s argument, and of dynamic eco- 
nomic theory generally, is that the formulation, identification, and 
estimation of the models must each be approached in substantially 
new and different ways. Most existing models simply cannot be saved 
by simulating them a little more shrewdly. 


* The papers by Anderson (1979) and Mishkin (1979) seem at least partly motivated 
by this interpretation. 

* The set of ideas I discuss in this paper has perhaps received most notoriety in the 
context of macroeconomic examples. In particular, substantial attention has been 
devoted to the sample economies of Lucas (1972a) and Sargent and Wallace-(1975) in 
which those systematic nonneutralities that come from imputing persistently subopti- 
mal expectations to agents were shown to disappear when the hypothesis of rational 
expectations was imposed on agents. Crudely put, certain classes of systematic mone- 
tary policies, in particular those which operate solely via deception, were rendered 
impotent in the Lucas and Sargent and Wallace examples. Since the publication of these 
papers, many papers have been published that have described setups in which the 
choice of systematic policy matters, even when rational expectations prevail. These 
papers usually invoke a source of nonneutrality not based on deception, of which there 
are many in standard macroeconomic theory. Papers of this class have often been 
interpreted as providing a defense of “pre-rational expectations” activist policies along 
lines that were produced by calculating optimal controls for Keynesian econometric 
models of the style of the late 1960s. In fact, no such defense is implied, partly because 
the methods by which optimal controls for government policy variables are calculated 
are very different in all rational expectations models from the procedures that were 
applied to pre-rational expectations models, but also because the ways in which 
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Formulating and estimating “rational expectations” models and 
dynamic equilibrium models of economic time series involves a variety 
of important conceptual and econometric issues, some of which I try 
to summarize in this paper. Among the issues to be treated are the 
following: 

1) Identification criteria. —Prior identifying information of the Cowles 
Commission variety, that is, mainly exclusion restrictions, plays a 
much smaller role in dynamic equilibrium models. Nonlinear cross- 
equation restrictions implied by dynamic theory are used extensively. 
This shift involves important modifications of past ways of thinking 
about identification and estimation. 

ii) Models of error terms. —The dynamic equilibrium modeling strat- 
egy virtually forces the researcher to think about the sources and 
interpretations of the error terms in the stochastic equations that he 
fits. The explicitly stochastic nature of the theorizing makes it difficult 
to “tack on” error terms after the theorizing is done, a usual proce- 
dure in the past. 

iil) The role of Granger causality. —Granger causality turns out to be a 
critical concept in the formulation of dynamic economic models, as it 
is coincident with the condition for the appearance as an information 
variable in an agent’s decision rule of a variable not otherwise in the 
agent’s criterion function or constraints. 

iv) Bayesian analysis. —Bayesian econometric techniques provide a 
means of mixing prior theoretical information about parameters with 
information from the data. Such procedures are widely used by 
applied time-series econometricians, although often no formal Bayes- 
ian justification is given. Dynamic economic theory provides no 
justification for one widely imposed class of prior restrictions which 
can be viewed as restrictions directly on decision rules. Instead, 
dynamic economic theory suggests that prior information about 
agents’ criterion functions and constraints is what should be used in 
estimation. This feature of dynamic economic theory has implications 
for the proper implementation both of formal Bayesian procedures 
and of less formal procedures for constraining parameter estimates. 

I shall organize my discussion around an example, namely, a 


econometric estimates are to be constructed for rational expectations models, with or 
without neutralities, differ substantially from the methods applied to the Keynesian 
models of the 1960s. The main point of the Lucas (1972a) and Sargent and Wallace 
(1975) examples is that substituting the assumption of rational expectations for 
“adaptive” expectations makes a critical difference for the methods both by which we 
should evaluate and optimally choose government policies. That same message is pres- 
ent in the papers of Fischer (1977), Phelps and Taylor (1977), and Hall (1978), even if 
superficially the differences in some qualitative features of the optimal policies under 
the two assumptions on expectations may have seemed less dramatic than in Sargent 


and Wallace’s example or Lucas’s. 
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linear-quadratic version of Lucas and Prescott’s (1971) model of in- 
vestment under uncertainty. I shall use this example for discussing 
the econometric implications of dynamic equilibrium models. I have 
adopted a linear-quadratic setup because it simplifies both the 
theoretical and econometric discussions, while illustrating many of the 
salient methodological implications of dynamic decision theory. 
Linear-quadratic optimum problems deliver difference equations that 
are linear in the variables and so match up nicely with much existing 
dynamic econometric theory. The reader familiar with Lucas and 
Prescott (1971) will recognize how the example can be generalized to 
incorporate more general specifications for the technologies, prefer- 
ences, and constraints. That increased generality would make the 
econometric implications harder to extract than with the present setup, 
without altering the basic message.* 


Investment under Uncertainty 


This paper describes a linear-quadratic version of Lucas and Pres- 
cott’s model of investment and uses it as a vehicle for expositing a 
variety of conceptual and econometric issues. The model describes 
the mutual determination over time of the capital stock, output, and 
market price of a single industry. The model can be generalized to 
handle multiple factors of production at the cost of what are really 
only technical complications. Similarly, the model could also be 
generalized to incorporate a set of industries, like the corn and hog 
industries, with interacting dynamics. Finally, I mention that it is 
straightforward to modify the model to incorporate much richer 
dynamics by generalizing the nature of the adjustment costs. 


* Using the methods of discounted dynamic programming (e.g., Blackwell 1965), 
theoretical results establishing existence and uniqueness of equilibria and various 
qualitative features of the equilibria can often be obtained for “weak” or “general” 
assumptions, such as that utility is concave, constraint sets are convex and monotone in 
shift variables, and so on. Lucas and Prescott (1971) and Lucas (1978) give interesting 
illustrations of these methods. These techniques were also used by Sargent (1980b) to 
make some general observations on interpreting time-series correlations between To- 
bin’s q variable and the aggregate rate of investment. However, for applied work, it is 
necessary to be able to calculate equilibria as a function of the free parameters of 
preferences and constraints, and it is highly desirable if the equilibria can be calculated 
easily. While for general functional forms it is in principle possible to calculate equilib- 
ria of recursive competitive models using a contraction mapping, in practice such 
methods are presently too expensive to use in empirical work. For this reason, for 
empirical work it is presently necessary to choose functional forms for which equilibria 
can be calculated either analytically or very quickly. Linear-quadratic specifications are 
one of the few such choices of convenient functional forms available. (Various versions 
of logarithmic specification are also sometimes tractable, e.g., Merton [1971].) A valu- 
able treatment of recursive competitive equilibrium models with general specifications 
of functional forms is Prescott and Mehra (1980). 
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I define the following variables: 


Me ATF 
n — 
y= 
po 
Dy= 


Dy, = 


D,= 


tp 
W: 


Ut 
Er 
ky 
K; = 


output of the representative firm; 
number of firms in the industry, assumed constant over time; 
ny, = total output of industry; 
price of output; 
a (p; X 1) vector of random variables appearing in the indus- 
try demand schedule, p, = 1; 
a (p — pi) X 1 vector of random variables which help predict 
future values of the collection of variables D,,, p = /1; 
ee } 
Do; ; 
rental rate on capital; 
a (q X 1) vector whose first element is w;; the remaining 
elements of W, are variables that help predict future w/’s; 
a random shock to demand; 
a random shock in the production function; 
stock of capital of the representative firm; and 
nk, = total capital stock in industry. 


The subscript t indexes the date to which the variable corresponds. 
I further define the following polynomials in the lag operator L: 


B2Ga) ea liccte Swlts 
j=1 


where 6,, is a scalar; 


TD 
Be Lu 0 ld, 
j=1 


where 6p; is ap X p matrix and J, is the p X p identity matrix; 


Tw 
Sgt) =le—> ial, 
j=1 


where 6,,; is a (gq X q) matrix and J, is the (q X q) identity matrix; and 


S.(LY=1— > 8.L4, 
j=1 


where 6,; is a scalar.® 
The industry consists of n identical competitive firms, each of which 
uses a single factor of production, capital, to produce a single output. 


> [ shall impose the condition that the zeroes of 6,(z), 6,(z), det 5p(z) and det 5,,(z) each 
exceed unity in modulus. Actually, a weaker condition would suffice, namely, that the 
zeroes of these polynomials each exceed VB in modulus, where B is the discount factor 
introduced below. These conditions on the zeroes are regularity conditions that assure 
that the infinite series calculated in eqq. (14) and (19) converge. 
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Output of the representative firm y, is governed by 


ye = fky + ney, Sas (1) 
where k, is the representative firm’s stock of capital at t, and e, is a 
random error in the technology. The firm knows {€;, €:-1, - - - Out 


does not know with certainty future values of the shock e;. The error 
€, is known to follow the r,th-order Markov process 


d.(L)e, = Vi, (2) 


where ’§ is a “fundamental” white-noise error term for e;.° The firm is 
assumed to know 6,(L) and E(V§)* with certainty. 
The demand curve for output is given by’ 


DA gee Aa tee) pial, Ags Ane. (3) 


where D,, is a (p, X 1) vector of “demand shifters,” A, is a (1 X py) 
vector of constants, and u, is a random shock to the demand curve. 
The random term u, obeys the r,th-order Markov process 


where V’¥ is a fundamental white noise for u,. The (p; x 1) vector of 
demand shifters D,, consists of the first p, rows of the p x 1 vector D,, 
which follows the r)th-order vector autoregressive process 


$5 (L)De= VP, (5) 


where V’? is a (p X 1) vector white noise that is fundamental for the 

process D,. The representative firm is assumed to know 6,(L), 6p(L), 

Ay, Ay, Ay, and the second moments of V% and V’? with certainty. 
At time f¢, total output is given by 


Y,=ny, =/K, 7 €;- (6) 


The representative firm’s problem is to choose a contingency plan 
for k,,; to maximize the criterion 


E i B' Por — weky — (bess * ka (7) 
t=0 


6 An (n X 1) vector white noise v? is said to be fundamental for an (n X 1) vector 
process x, if the vector of one-step-ahead linear least-squares errors in predicting x; 
from past x’s can be written as a linear combination of n components of v7. 

7 Since a simple static demand function is posited, all of the interesting dynamics of 
the model come from its supply side. Specifying a demand schedule with interesting 
dynamics would complicate the presentation but not alter the basic messages of our 
example. Telser and Graves (1971) analyze dynamic optimization problems in which 
much of the interesting dynamics come from a demand curve that is specified. Sargent 
(1979, chap. 16) analyzes a model of the labor market in which the dynamics are 
influenced by nontrivial dynamic optimization problems solved by both suppliers and 
demanders. 
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subject to ko given. In (7), E; is the mathematical expectation operator, 
conditional on information known to the firm at time t. This informa- 
tion set will shortly be specified precisely. In (7), d is a positive con- 
stant. The term (d/2)(k;,; — k;)” is intended to represent the notion 
that there are costs internal to the firm of adjusting the capital stock 
and that these rise at an increasing rate with the absolute value of the 
change in capital. We assume that the rental on capital w, is the first 
element of the (q x 1) vector random process W, that obeys the 
r,th-order vector autoregression 


8 o(L)Wi = V?, (8) 


where V? is a (q X 1) vector white noise that is fundamental for W;. 
The firm is supposed to know 6,,(L) and the second-moment matrix 
of V? with certainty. 

At time ¢, the firm chooses k;,,, given the information that it has 
available at ¢. However, the maximization problem (7) is not yet well 
posed, since we have not completely spelled out the dynamic con- 
straints with respect to which the maximization is supposed to occur. 
To complete the problem (7), we begin by substituting (fk; + n7'e;) 
for y,, and (Ay — A,fK; — A,€; + AzDy + uz) for Py to get 


Eo PAC (Ao ae A,fK; 7 A,€; AD ote ur) (fRe sta n-+€;) 
t=0 


1 (9) 
= sky — oi Cop ky)? . 


In order that the problem of maximizing (9) with respect to a 
contingency plan for {k,4;} be well posed, it is necessary to attribute to 
the firm precise views about the laws of motion of the random vari- 
ables that it cannot control, but whose values influence the best choice 
of its own stocks of capital. For problem (9), these uncontrollable 
variables about which the representative firm cares are K;, Dy, u;, €, 
and w;. The firm cares about the present and future behavior of the 
variables (K;, D4, us, €:) because they influence the present and future 
behavior of the market price through the demand relationship P; = 
Ay — AifK; — Aye; + AgDy, + UW. The firm cares about the evolution of 
the rental process w; because it influences its costs. We have already 
completely described our assumptions about the firm’s views of the 
laws of motion of Dy, u;, €;, and w;, namely, that the firm knows the 
Markov laws (4), (5), (2), and (8) that govern them, and at time ¢t knows 
DED Bie el 8 he See oe tees AOU gs WV 15:24 9 Lo.complete 
the specification requires that we specify the firm’s views about the 
evolution of the aggregate capital stock K,. We assume that the repre- 
sentative firm believes that the aggregate capital stock evolves ac- 
cording to the law 
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Kis = Ho + Hy (L)Wi + Ay(L)D, + AL) G (10) 
+H Eup A he 


where H, and H, are scalars and 
ryt 


Heg( Le) =o tl onlays WHEE TT ogi 1a G3 
j=0 


Thm 
Hy(L) = > HyL, — where Hp; is (1 x p); 
5=0 
ren 
Cee Dy Aol where H,, is a scalar; and 
5=0 
ry-1 
Ay (LyY= S, Age, where H,,; is a scalar. 
5=0 
The representative firm is assumed to know all of the parameters of 
the linear law of motion (10) with certainty. The reason that we have 
chosen the form (10) as the firm’s perceived law of motion for K will 
shortly become apparent. 
With these specifications, the maximization of (9) is now well posed. 
Summarizing the setup, we have that the representative firm 
maximizes 


Eo wh BE (Ao a A,fK; = Ai& =f AsD 4 aa ur) fhe ae n~‘e,) 


t=0 


— wiki — $ (hess - hy) ® 
subject to the laws of motion® 

Kyi= Hy PAW ees. DP mde: th 

1 (er 
Sw(L)Wr = V?, (8) 
du(L)ur = Vi, (4) 
dp(L)Di = V?, (5) 
d.(L je, = Vi, (2) 


and subject to the information set at time ¢,? 


* It would be straightforward to modify this setup to assume that the {W, u, e, D} 
processes are each finite order mixed moving average, autoregressive processes. For 
the details, see Hansen and Sargent (1979). 

* These variables completely characterize the “state” vector for the firm’s problem. 
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The firm maximizes (9), taking the laws of motion (8), (4), (5), (2), and 
(10) as given and beyond its control. The firm is assumed to behave 
competitively and to act as if it has no control over the aggregate 
capital stock K. This is a reasonable assumption if n is large. The firm 
is assumed to know the 6’s and H’s with certainty and to know the first 
and second moments of the V/’s.!° We further restrict the problem so 
that the solution is a linear contingency plan." For this to be true, it is 
sufficient that the least-squares predictors of future W, D, €, and u’s be 
linear in the conditioning variables. This will be true if V§, V%, V2, and 
V? obey normal probability laws. Alternatively, the analyst can simply 
assume that the industry is operating under optimal linear rules. In 
either case, the solution of the representative firm’s problem is a 
linear contingency plan of the form! 


Resy = hy + hy(L)W; + hp(L)D; + he (L) € 
=f hy (L )uy + hyK; Sa hoki, 


(11) 


where ho, A,, and hz are scalars, and 


Ty-l 


igh) = Bs) etl where hy; is (1 X q); 
5=0 


pri 


hp(L) = > hol}, where Ap; is (1 x p); 
j=0 


Te-1 


h(L)= > hight, where h,; is a scalar; and 


Ty-1 


h,(L) = » hgld where h,, is a scalar. 
j=0 


The h’s of (11) are in general functions both of the parameters in the 
criterion function (9), the parameters of 6,, 5,, 5p, and 6, appearing in 


We have in mind that the firm actually has observations on values of K,k, W,D, €, andu 
for all dates ¢ and earlier. It turns out that the firm’s decisions are optimally a function 
only of the information set listed in the text. 

10 It is assumed that each of V?, V7, V?, and V¢ is orthogonal to the information set 
{Wis Ut—s, Des, Erg, § 2 3 : 

11 This is because we want the stochastic deteyehce equations describing the behavior 
of the system to be linear and thereby to be readily susceptible to econometric analysis. 

12 That the solution to the problem is of this form follows from linear optimal control 
theory (see Kushner 1971, chap. 9; Bertsekas 1976, chap. 3; Sargent 1979, chap. 14; 
or Kwakernaak and Sivan 1972). 
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(8), (4), (5), and (2), and the H’s of the perceived law of motion for 
capital (10). The mapping giving the h’s as functions of these other 
parameters is defined implicitly by standard formulas in linear opti- 
mal control theory, as exposited, for example, by Kwakernaak and 
Sivan (1972) and Bertsekas (1976). For present purposes, it is enough 
to note the existence of this mapping without exploring its nature in 
detail. The economic content of the mapping from the 6’s, H’s, and 
objective function parameters to the h parameters of the firm’s deci- 
sion rule is easy to understand, since it captures the notion that the 
firm’s rule of choice depends on both its objective and its perceived 
constraints (10), (8), (4), (5), and (2). 

Multiplying both sides of the firm’s decision rule (11) by n and using 
K, = nk; gives 


Ky, = nny + nh, (L)W, + nhp(L )D, + nh (Le, 


1 
+ nh, (L)u, + (nhy + ho) Ke. (Ta) 


Equation (12) is the actual law of motion for aggregate capital that 
results from the behavior of the representative firm. The represen- 
tative firm’s optimization problem in effect induces a mapping from 
the firm’s perceived law of motion for aggregate capital (10) to the 
actual law of motion (12). For each possible particular perceived law 
of motion of the form (10), there is an implied law of motion for 
aggregate capital of the form (12). The notion of rational expectations 
is that the representative firm’s perceptions of (10) are correct. In 
effect, a rational expectations equilibrium is a fixed point of the 
mapping that the representative firm’s optimization problem induces 
from (10) to (12). Formally, we define a rational expectations equilibrium 
as a perceived law of motion (10) and an implied actual law of motion 
(12) which are identically equal. In a rational expectations equilib- 
rium, firms’ perceptions about the law of motion for aggregate capital 
turn out to be confirmed by the aggregate of the choices made by 
firms. Upon comparing (10) with (12) it is evident that necessary and 
sufficient conditions for a rational expectations equilibrium are 


Hy = nho, 
H,(L) = nh,(L), 
H)(L) = nhp(L), 
H(L) = nh,(L), 
H,(L) = nh,(L), 
Hy, = (nh, + ho). 


Implicit in the above definition of a rational expectations equilibrium 
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are the following elements: (a) market clearing, (6) optimization of the 
firm’s expected present value, and (c) correct perceptions on the part 
of firms of the laws of motion of variables affecting their present value 
but beyond their control. 

We begin our analysis of the model by briefly describing aspects of 
the optimization problem solved by the firm. Among the first-order 
necessary conditions for the maximization of (9) is the following 
system of stochastic “Euler equations,” which are derived by differ- 
entiating (9) with respect to k, fort = 1,2,...: 


Bdk,., = d(1 a B)ky ss dk, — Bu; 
— Bf (Ao - A fK; — Aye, + A2Dy + uz), 


(13) 


or 


bos (Gt Ike + Fhe = ur - Fp, 
In addition to the system of Euler equations, a transversality condition 
is among the first-order necessary conditions. The transversality con- 
dition can be derived by methods described in Sargent (1979). The 
transversality condition for the present problem in effect requires 
that the solution possess the property 


lim EB Res; = 0. 
jn 


Using the lag operator, the preceding Euler equation can be re- 
written as’? 


E a ae 5 I)L Se a thee — Swi = io 


Using the factorization 


1 1 1 
Des liar 2? | (1 Se \(i= 1) 
B B B 
the above Euler equation can be written as 


1 
(1- GLa - kar = pw - FP, 
Noting that [1 — (1/B)L] = —B-'L(1 — BL“) and operating on both 
sides of the above equation with [—B4L(1 — BL™~')]~! gives the 
solution’* : 


8 For a discussion of the use of lag operators in the present context, see Sargent 


(1979, chaps. 9 and 14). 
4 Tn effect, the transversality condition compels us to solve the unstable root forward 


in this manner. 
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(1 — L)Res = bese ba lenGies te 


or, equivalently, 


(1 — Lyk: = —d-18 Y Blower + Bd > BPs (14) 
i=0 i=0 


It can be verified that (14) satisfies both the Euler equations and the 
transversality condition. Equation (14) would give the appropriate 
rule for setting k,4, if the firm had perfect foresight about the entire 
future paths of the rental w, and the output price P;. When the firm 
does not have perfect foresight, the correct decision rule can be 
derived by replacing the future values on the right side of (14) with 
the corresponding mathematical expectations conditional on infor- 
mation the firm does have. This leads to the decision rule” 


CL iii Oa 8 Se BE Weiss | OQ, + Bid Ds BOEP print | ,.., (15) 
i=0 i—0 


Here (, is defined as the information set 0, = {W,, W.-4, . . . . Up, Ui-1, 
2. Dp Dir,» - + 5 & €t-1) » « » » Ky}. The conditional mathematical 
expectations are assumed to be computed using the laws of motion 
(10), (8), (4), (5), and (2) for K, W, u, D, and e, respectively, as well as 
the demand relationship P, = Ay) — 4, (fK; + €&,) + A.D + 4, which 1s 
used to deduce the law of motion for P,. Once these conditional 
mathematical expectations are explicitly calculated in terms of the 
parameters of (10), (8), (4), (5), (2), and the demand curve (3), they 
can be substituted into equation (15) to deduce the optimum decision 
rule (11) for the representative firm. The decision rule (11) is linear in 
all of the information variables that appear on the right side. How- 
ever, as the above method of calculating the parameters h of the 
decision rule (11) suggests, the parameters h are themselves compli- 
cated nonlinear functions of the underlying parameters of the model: 
the parameters Ay, 4,, A, of the demand curve, the parameters f and d 
of the technology, and the parameters 6,(L), 5.(L), 5,,(L), and 5p(L) 
of the laws of motion of the random processes given from outside the 
model.'® The h’s are also nonlinear functions of the H’s of the law of 


‘5 As noted above, we shall want the relevant conditional expectations to be linear. So 
we shall regard the E(- | 0,) that appears in (15) and elsewhere as wide-sense condi- 
tional expectations, that is, linear least-squares predictors. This amounts to restricting 
the firm to linear decision rules, as desired. 

16 The parameters 8 and n also belong in this list of underlying parameters of the 
model. I shall usually delete these two parameters from subsequent listings of the 
model’s underlying parameters, though they should be understood. In some applica- 
tions, the analyst may want to specify counterparts of 8 and n completely a priori, in 
which case they would not be included among the free parameters of the model over 
which the likelihood function or other measure of “fit” is to be maximized. 
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motion of aggregate capital (10), which are not given from outside but 
are to be determined from the analysis. The nature of these non- 
linearities has been characterized by Hansen and Sargent (1980b) and 
will be alluded to further below. 

Equation (15), which was derived by purely formal manipulations, 
has the virtue of indicating clearly that the firm has an incentive to 
forecast future realizations of the rental w and the output price P. As 
a result, any state variables that the firm sees at ¢, and that help predict 
either future P’s or future w’s, will appear in the firm’s decision rule 
for k,,,, given by equation (11). That the h’s of (11) are nonlinear 
functions of the parameters {Ao, A,, Ao, f, d, B, By, Se Sp» 8p, Hy, Hes Hy, 
Hy, Hy, and H,} stems from the nonlinear way in which the condi- 
tional mathematical expectations of future w’s and P’s are functions of 
these parameters. 

In practice, to compute a rational expectations equilibrium it is not 
necessary ever to calculate the right side of (15). Indeed, it is never 
necessary explicitly to calculate the h’s that determine the decision 
rule (11) of the representative firm. Instead, the H’s of the equilib- 
rium law of motion for the industry can be calculated directly as 
follows.’” First, multiply both sides of equation (13) by n, then use K, = 
nk, and collect all terms in K on the left side to get 


BdK,,, — [d(1 + B) + 41f? BnlK, + dK,_, = Bru, 


16 
— BnfA, + A, Bfne — BfnA,Dy — Bfnrm. sa 


It is of some interest that (16) is itself the Euler equation for the “social 
planning” problem of maximizing’ 


Eo SB AWK ef) SAK FG, ears (fK; + €,)A Diy 
aot (17) 
1 


9 WA Kye = Ky), 


= (fK, or cu =i K A= 


17 The following argument in the text provides a way of discovering Lucas and 
Prescott’s (1971) method of calculating the rational expectations equilibrium by for- 
mulating a fictitious social planning problem that reproduces the equilibrium. It is 
worth remarking that Kydland and Prescott (1977) describe a recursive method of 
calculating a linear rational expectations equilibrium that is applicable to our problem 
and is distinct from the Lucas-Prescott method upon which the discussion in the text is 
based. Kydland and Prescott’s method successfully computes the equilibrium even in 
instances in which the Lucas-Prescott method breaks down. These instances occur, e.g., 
in which there is feedback from the industry-wide aggregate capital stock K to W or D, 
as would occur if lagged K’s appeared as states in the Markov law for W or D. In such 
instances, Lucas and Prescott’s social planning problem fails to reproduce the rational 
expectations equilibrium essentially because the fictitious planner takes into account the 
externality that the feedback from K to W or D constitutes. 

18 This was emphasized in a more general context by Lucas and Prescott (1971). 
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subject to the laws of motion (8), (4), (5), and (2) for w;,, Uz, D4, and e,'9 
and subject to Ky given. 
The term in brackets is the area under the demand curve, since 


Y 
(A, - AGX Se AD, a U,)dx = ny a SAY ar Y,A,Dy in Y ay. 
0 
Thus (17) is the discounted area under the demand curve minus the 
total costs of production. Dividing each side of (16) by Bd, the Euler 


equation can be written 


Kia ae (} ca dy ate Ayfin 1 =F lia 


B d p 
=Ry,- he + AP (18) 


a d-‘jnA,D,; ia i Ut. 





It can easily be proved that there exists a such that 
[1-1 +B + AyfPnd- Lb + BOL?) = (1 — (AB) L]0 — AL), 


where | \| < 1/V/B.?° Using — (A8)~'L(1 — ABL~!) = [1 — (ABy"!L], we 
have that the Euler equation (18) can be written as 





= QB)AL(L = ABL“N(L = ALK = 9 + 2, 


+A. - 2 4p,- eu, 


A solution of the Euler equation that also satisfies the transversality 
condition for the social planning problem is 


niB Bae 
aT Nero 





(lis AD) Kei Eno (1 — Ag)? - 


A,fnXr Le ndBd-"L~! 
ee ¥ eae oe it 
fndBd-1L- 
A ame 


Recall, for example, that (1 — ABL~!)~1w; = Yio (AB)*wy4j. Then it can 
be recognized that equation (19) is the perfect foresight solution of 


It can also be proved that the transversality condition for (17) imposes the same 
condition on the solution as does the transversality condition of the representative firm. 

° This follows directly from the observation that if zo is a zero of [1 — (1 + Bu! + 
Aly ndm: 2a Saez Rune O SONS I Ozgue 
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the planning problem that the rational expectations competitive equi- 
librium implicitly solves. Thus, equation (19) expresses the aggregate 
capital stock K,4; as a linear function of K;, and all future values of w;,, 
€,, Dy, and u;. 

By using the methods of Hansen and Sargent (1980b, esp. appendix 
A), equation (19) can be converted to the “realizable” law for K that 
satisfies the Euler equations and transversality conditions, and which 
expresses K,,,; as a function only of information known at time ¢.”! 
This involves replacing the terms w;4;, €44;, Dyz4i, and U;4; in (19) by the 
corresponding mathematical expectations conditioned on Q,. The 
resulting equilibrium law of motion for K can be shown to be 


Key — Hot (LW ap (b)DPr alle 





(20) 
+ H,(L)u, + H,K;,, 
where 
_ +ABnfAy 
tte Te AB)’ 
A,=h%, 
ae A Ll — O,\AB)n on(L)| 
H,,(L) a d Pw emntee mada 
rs lA Bo (L 
Ee eee ca Eee 
(21) 
Hy(L) = +fndBd Andy [aE oh eed | 
Sli jie= “ r EOF iB 
H,(L) = +d7¥fndB [ee 


Here ¢,, isa 1 X g vector with 1 in the first position, followed by (q — 
1) zeroes, and ¢p is ap, X p matrix with a (p, X p,) identity matrix as 
the first p, columns and zeroes elsewhere. Notice that w, = $,,W; and 


Dy, = bpD,. It is convenient at this point to recall the laws of motion 
assumed for w;, u;,, €, and D,,, namely, 
5(L)W, =V?, (8) 
5,(L)u, = Vi, (4) 


21 Note that eq. (19) satisfies the first-order necessary conditions for the optimization 
problem but gives the planner too much information (it is “anticipative” or “nonrealiz- 
able”). The correct solution to the problem taking the information set available to the 
planner into account is the solution of the first-order necessary conditions that expresses 
Kj4, as a function only of information that the planner possesses at time t. Such a 
solution is said to be “realizable” or “nonanticipative.” 
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dp(L)D; = V?, (5) 
b-(L)e, = Vi. (2) 


Equation (20) expresses the equilibrium law for the industry-wide 
capital stock K,,, as a linear function of K, and current and past values 
of W, D, e, and u. Current and past values of W appear in (20) because 
they help predict future values of the rental rate w,, while current and 
past values of D, e€, and u appear because they are used by agents to 
predict the future course of the market price P. The numbers of 
lagged values of W, D, e€, and u in (20) arer,, — 1,7) — 1,7. — 1, andr, 
— 1, as expressions (21) can be used to show.”? Thus, the numbers of 
lagged values of these “information variables” W, D, e, and u in (20) 
are entirely inherited from the specifications of the actual laws of 
motion for W, D, e, and wu in (8), (5), (2), and (4). 

Notice that the appearance of w, and D,, in the objective function of 
the representative firm (9) (or equivalently in the objective function of 
the fictitious social planner [17]) gives rise to the appearance in (20) of 
the entire blocks of variables W, and D, that help predict w and D,,, 
respectively. Thus any variables that help predict w and D,,, and 
which agents have information on, belong in the equilibrium law of 
motion for industry-wide capital. The property that the remaining 
variables in W (or D) help predict future values of w (or D,) is said to 
be the property that the remaining variables in W (or D ) Granger cause 
w (or D,). The notion of Granger causality thus turns out to be 
coincident with the criterion for whether random variables that do 
not themselves appear in the agent’s criterion function nevertheless 
end up in the equilibrium law of motion or decision rule, essentially 
because they appear in the agents’ constraints as information variables 
that help predict variables that do appear in the criterion function. It 
is mainly for this reason that the concept of Granger causality has 
played an important role in work with rational expectations models.?3 


* By expanding the polynomial in L, it is possible to show that 
LAU = 8(48)8(L)] y 


= 6(XB)"} N, ic —J—1 TA 3 
ea Ves (8) {S >, p) a | (22) 
where 8(L) =I — 3; _, 8,L’. Notice that the polynomial on the left side of (22) is one sided 
in nonnegative powers of L, despite the appearance of L~', and that it is a polynomial of 
order (r — 1), as asserted in the text. The formula (22) can be derived by mimicking the 
procedures used in Hansen and Sargent (19802). The same mathematical techniques 
used by Hansen and Sargent (1980a) to derive expressions like (21) or (22) were 
independently utilized by Futia (1979) to compute linear rational expectations equilib- 
ria. Also, without knowing of Hansen and Sargent’s work, John Kennan independently 
derived formulas similar to (22) in a personal letter to me. 

° From this point of view, it is irrelevant whether Granger causality is consistent with 
one’s notion of what “true” causality is. Sims (1972) has described the relationship of the 
concept of Granger causality to that of strict econometric exogeneity. That a random 
process y fail to Granger cause x is a necessary condition for x to be strictly econometri- 
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Equations (20) and (21) reveal explicitly how the parameters of the 
equilibrium law of motion for industry-wide capital are themselves 
nonlinear functions of the underlying parameters { A), 4,, Asef pa, BG. 
n, 6,(L), 6,(L), 6.(L), &p(L)}. The nonlinearity has two sources. First, 
there is the fact that \ is a nonlinear function of B and (A, f?’nd-) via 
the factorization defining \, [1 — (AB)“1L](1 — AL) =[1-(1+ p+ 
A,f?nd—)L + B-!L?]. Second, given \, the formulas for H,,(L), H,(L), 
A,(L), and Hp(L) in (21) are nonlinear in the parameters of 6,,(L), 
6,(L), 6.(L), and 5p(L). Nonlinear cross-equation restrictions of the 
kind illustrated by (20) and (21) are the hallmark of rational expecta- 
tions models. Such cross-equation restrictions are largely absent from 
“pre-rational expectations” dynamic econometric models.24 The 
presence of these restrictions impinges on a variety of fundamental 
econometric and conceptual issues, including identification, the 
analysis of interventions, models of “error terms,” and the role of 
“prior information.” I now turn to discussing each of these issues, 
using (20) and (21) as an instrument. 


Analysis of Interventions 


At this point, it is useful to remind ourselves of the principal reason 
that an economist might want to construct a dynamic econometric 
model of an industry along the lines of our example. It is to be able to 
make quantitative predictions about the effects on the industry that 
various hypothetical interventions or “changes in the environment” 
will have. In the present context, a hypothetical “intervention” or 
“change in the environment” means a change in one of the polyno- 
mials 6,(L), 5,(L), 5p(L), or 6.(L) that describe, respectively, the 
stochastic processes for W, u, D, and € that impinge on the market.?® 
Several interesting examples of such interventions can be given, in- 
cluding the following: 

a) Suppose that there is a specific tax imposed on sales of the 


cally exogenous with respect to y. For this reason, the concept of Granger causality is 
also useful in designing specification tests. For a discussion of the relationship between 
Granger causality and econometric exogeneity in the context of linear rational expecta- 
tions models, see Hansen and Sargent (1980a). 

*4 Thus Fisher wrote: “In practice, except for such covariance restrictions [across 
disturbances in distinct structural equations], restrictions which relate the parameters 
of one equation to those of one or more others are extremely rare. There is no reason 
in principle why such cases cannot occur, however, and it may be worthwhile devoting a 
very short discussion to them” (1966, p. 176). 

*5 By now, this is a routine and uncontroversial definition of an intervention. Appli- 
cations of the techniques of optimal control theory to the calculation of mac- 
roeconometric and microeconometric policy response functions employ precisely this 
concept of intervention (see, e.g., Chow 1973; Kareken, Muench, and Wallace 1973; 
Arzac and Wilkinson 1979; and Taylor 1979). 
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product. Such a specific tax can be modeled as a component of 
(A,D,,). Since the behavior of the tax through time will be described 
by an element of the vector Markov law 6;(L)D; = V?, changes in the 
rule for setting the specific tax amount to changes in one of the rows 
of 6)(L). 

b) Suppose that there is a specific tax on the use of the factor of 
production. This tax can be modeled as an addition to the rental w;. A 
change in the rule for setting this tax can be modeled as a change ina 
row of 6,,(L). 

c) Suppose there is a change in the structure of the process gov- 
erning the “pretax” part of the rental. Again this can be modeled as a 
change in one row of 6,,(L). With a little imagination, the effects of a 
change in the organization of the industry”® supplying the factor 
might be modeled in this way. 

The model leading to (20) and (21) provides a way of predicting 
quantitatively the effects of such changes, once agents have caught on 
to them. The effect of interventions in the sense described here is to 
change the function (20) describing the evolution of industry capital 
in a way predicted by the formulas given in (21). Since interventions 
of this class change the law of motion (20), it is necessary to have 
analytic methods which use the cross-equation restrictions (21) to 
predict how the H’s of the K-law of motion (20) will change if there is 
a hypothetical intervention operating on one or more of the &’s. 

In order to evaluate policy interventions in this way, it is essential 
that the H’s of (20) should not be viewed as being among the free 
parameters of the model. Instead, the model’s free parameters are to 
be regarded as the deeper parameters {Ao, 4,, Ao, f, d, Sy, Sys Se, Opty: 
The researcher needs to know these parameters in order to be able to 
use the formulas (21) to predict the consequences of hypothetical 
changes in the functions 8.27 

From the dynamic economic theory leading to (20) and (21), it is 


*° E.g., if it becomes a cartel when before it had been competitive or noncooperative 
in some way. 

*7 A technical qualification needs to be added at this point. In order to have a model 
capable of predicting effects of interventions acting on the 8’s, one can sometimes get by 
without having uniquely identified the parameters {A, f, d}. What the researcher must 
identify are the parameters of the characteristic polynomial of the Euler equation (16), 
namely the parameters do, ¢, in {Ad — [d(1 + B) + Ayf*nBIL + dL} = (Bd, + dol + 
$L”), where $, = d and. —¢, = [d(1 + B) + A,f?nB]. The theory assumes that the d's will 
be invariant with respect to interventions on the 8’s. If the researcher can uniquely 
identify the @;'s, he can proceed with econometric policy evaluation, even if he cannot 
uniquely identify all of (A,, f, d). In some setups, the parameters of the characteristic 
polynomials of the Euler equations are identified even though only an equivalence class 
of the counterparts of (A,, f, d) is identified. This is enough for econometric policy 
evaluation to proceed. This problem is discussed by Hansen and Sargent (19808). It is 
technically related to the “inverse optimal control” problem (see Mosca and Zappa 
1979). 
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evident that a given numerical version of (20), estimated from histori- 
cal data, cannot be used to evaluate the consequences of arbitrary 
input sequences for {W,}, {D,}, {e,}, and {u,}. That is, a fixed law of 
motion of the form (20) with given numerical values for the H’s 
cannot be used to investigate the consequence of arbitrarily specified 
numerical sequences for the W, D, e, and w’s. In effect, a particular 
version of (20) can be expected to hold up only for W, D, e, and u 
sequences drawn from a restricted domain: namely, sequences obey- 
ing the probability laws (8), (4), (5), and (2).78 

However, until Lucas wrote in 1976, evaluating the effects of inter- 
ventions in this inappropriate way was the accepted procedure in both 
the macroeconometric and the microeconometric literatures. Re- 
grettably, to this day it remains the procedure used in the over- 
whelming majority of analyses of policy interventions. It should be 
emphasized once again that from the viewpoint of the dynamic deci- 
sion theory described above, the question of how agents will respond 
to “arbitrary sequences” of “forcing variables” W, u, e, and D is not 
well posed. In effect, unless the researcher specifies precisely the 
perceived laws of motion for the “forcing variables,” he has not 
specified the constraints subject to which decision makers are thought 
to be acting.”9 

Thus, in order to be able to evaluate interventions operating on the 
8’s, it is necessary to formulate and estimate the model in terms of the 
parameters of preferences (Ao, Ay, A2), technology (f and d), and the 
constraints (the 6’s). The argument in favor of formulating and es- 
timating the dynamic model at the level of the deep parameters {Ao, 
A,, Ag, d, f, 8, 5y, 5, 5p} is in much the same spirit as the usual 
justification for estimating “structural” parameters rather than 
reduced-form parameters. As Marschak (1953) argued, the re- 
searcher wants to estimate those objects which will permit him to 
analyze an interesting class of changes in the environment. Dynamic 
economic theory has forced us to reexamine whether objects long 
thought to be “structural,” including the parameters of decision rules 
such as consumption, investment, and portfolio balance schedules, 
are correctly taken to be invariant with respect to changes in the 
environment. Once agents’ behavior is modeled in terms of genuinely 
dynamic optimization problems, it becomes apparent that the param- 
eters of observed decision rules should not be viewed as structural (see 


28 This message is at least implicit in the Work by Lucas and Prescott (1971). Gordon 
and Hynes (1970) made the argument in an informal way. Lucas (1976) forcefully 
brought the message to the attention of macroeconomists. 

29 However, some economists continue to argue that existing macroeconometric 
models can be used to predict the effects of such arbitrary sequences (see Friedman 


1978). 
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Muth 1961; Lucas and Prescott 1971; Merton 1971; and Lucas 
1972a). 


The Neglect of Learning 


At this point it is worthwhile to discuss a modification of the preceding 
kind of setup which several economists have apparently had in 
mind.*° For this purpose it is sufficient to consider the problem of 
maximizing the social welfare criterion subject to the given laws of 
motion (8), (4), (5), and (2) for W,, u,, D;, and «. By relabeling and 
reinterpreting the variables, we can think of this as a choice problem 
faced by a single private agent. In posing this problem, it was assumed 
that the agent solving the problem knows the true values of the 
parameters of the objective function (17) and the true values of the 
polynomials in the lag operator 8.(L), 5,(L), &p(L), and §,,(L). The 
observation has been made that this setup fails to incorporate a model 
of how the agent optimally learns about the 8’s from observations on 
past realizations of the forcing variables e, u, D, and W. Presumably, if 
the agent has only finite histories of observations on e, u, D, and W at 
his disposal, then at each point in time he is uncertain about the 
parameters of the polynomials 6. Why not modify the preceding setup 
to include uncertainty about the 8’s and a model of optimal learning 
about the 6’s? There seem to be three reasons why such extensions 
have not as yet successfully been incorporated into rational ex pecta- 
tions models. 

The first is as follows. A general model of optimal learning about 
the 6’s is readily available in the “Kalman filter,” which can be used to 
model how a rational agent would use observations on (€,, u,,D,, W,) to 
revise his prior beliefs about the 8’s.*! However, with the 6’s uncertain, 
it is no longer possible to give closed-form formulas for the optimal 
decision rule in terms of what are now the posterior probability 
distributions over the 8’s. The reason that no one has yet obtained or 
is likely ever to obtain such closed formulas is as follows. In deriving 
the closed form of the restrictions (21) for the case in which the 8’s are 


*° See Friedman 1979 and Modigliani 1977. 

5! See Anderson and Moore 1979. The Kalman filter provides a model of Bayesian 
learning about the 6’s where the initial prior and the posteriors are multivariate normal. 
However, as Hansen points out to me, normal posteriors for the 8’s are inadmissible for 
dynamic models of the class described here. This is because the dynamic optimization 
problems we consider may be ill posed for points in the parameter space of 8’s for which 
the zeroes of det 5(z) are less than V@ in modulus. Only priors and posteriors that 
assign zero probability to this region of the parameter space are in general admissible 
for our problems. This rules out multivariate normal distributions. Taking account of 
this admissibility constraint severely complicates the task of building a model of optimal 
learning about the 6’s. 
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assumed known with certainty, the Wiener-Kolmogorov prediction 
formula, 


-1 
EW = [Pe] doh 


was used extensively. Here [33_,. aj;L*], = joo a,L/, so that [ [2 
means “ignore negative powers of L.” The Wiener-Kolmogorov for- 
mula is equivalent with the “chain rule” of forecasting (see Shiller 
[1972] or Sargent [1979] for expositions). These equivalent forecast- 
ing rules are known to be correct for the case in which the 8’s are 
known with certainty. However, as Chow (1973) has pointed out, 
where there is a nontrivial posterior density over the 8’s, there is in 
general no known closed-form formula such as the above one for the 
2-step-ahead forecast. For example, it is not true that where 6 is 
uncertain, the correct expression for E,W,,,; is given by replacing the 
6,j;S with their posterior means in the above formula. The fact that 
there is no closed-form prediction formula for sufficiently general 
cases implies that it is impossible to derive closed-form versions of 
decision rules (and hence equilibria) that correspond to (21). As we 
shall see, for the kind of empirical work we are advocating, it is 
important to have a closed form for the mapping from the param- 
eters of the objective functions (17) and the dynamic constraints to the 
decision rule (20). From this viewpoint, the suggestion that one ought 
to build a learning mechanism into rational expectations models is not 
useful in suggesting practical econometric alternatives to the proce- 
dures recommended here.*? 

Another drawback with incorporating learning is that, even if one 
could derive the decision rules in the face of uncertain §’s, the issue 
would arise of how to determine the prior used to initiate the learning 
model for the 6’s. Would it be imposed a priori or estimated? If the 
initial prior were to be estimated, this would substantially complicate 
the estimation problem and add to the number of parameters. 

Finally, in many settings the Bayesian learning model implies that 
the posterior distributions collapse about the true 6’s as time passes 
without limit. In such settings, even if the researcher erroneously 
assumes that the 6’s are known with certainty when in reality agents 
are learning about them in an optimal way, the researcher continues 
to obtain consistent estimators of the underlying parameters {A , A,, 


32 Further, notice that if the decision rules could be calculated in closed form under 
uncertainty about the 6’s, the resulting time-series models would have time-varying 
coefficients and so be nonstationary. Even if calculating the decision rules were a 
tractable task under uncertainty about the 8’s, the loss of stationarity that it would imply 
might well be a price that the applied economist would not be prepared to pay even in 
exchange for the “greater realism” of the learning assumption. 
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Ags cst 0,fOn, 0am Ons using the methods described here and in 
Hansen (1979) and Hansen and Sargent (1980a). It does seem likely 
that by erroneously ignoring the phenomenon of learning about the 
§’s, the researcher is incorrectly calculating the asymptotic covariance 
matrix of his estimators. However, at present nothing is known about 
the nature of this error. Further, since we simply do not know how to 
compute optimum decision rules under the assumption that agents 
know the 8’s with uncertainty, no consistent estimators of the under- 
lying parameters have been proposed that incorporate agents’ learn- 
ing about the 6’s in the optimal way, to say nothing of expressions for 
the associated asymptotic covariance matrices. 

From the preceding considerations, I draw the conclusion that 
incorporating optimal Bayesian learning about the 6’s on the part of 
agents is not a research avenue that soon promises appreciable divi- 
dends for the economist interested in applying dynamic competitive 
models of the sort described here. 


A Model of the “Error Term” 


We now derive a “dynamic supply curve” for the industry by using the 
industry-wide production function Y, = fK; + €; to eliminate K from 
(20) in favor of Y. Multiplying both sides of (20) by f and then adding 
€;,, to both sides gives 


Yrs = Aof + fA w(L)W. + fA(L)D: 
+ fA (L)e, + fHy(L)ue 
lee lel hy eee eld eer 
Eliminating u, by using wu, = Py — Ao + AiY; — A2Dy gives 
Yess = [of — fHu(1)A0] + fAu(L) Pe + fAw(L)W, 


+ [fHp(L) ~ fy(L) Asbo 1D, as 
+ [H, + fA, (L)AIY: 
TD rr ie 
This can be written as 
Yip oat (Salo te oun La) Wa 
(24) 


+ Sp(L)Dy + Sy(L)¥e + SL ests, 


where 
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So = Hof — fAx(1)Ao, 

Sp(L) = fH, (L), 

S»(L) = fH,(L), 


(25) 
Sp(L) = [fAp(L) — fA (L) Asbo), 
Sy(L) = LH, + fA,(L)Aq], 
S(L)—(l4+ fA LL — Ary. 
Recall that the demand curve is 
Pe, Set Ae (3) 


Using 6.(L)e, = Vé and 6,(L)u, = V¥, we can write the supply and 
demand curves as 


Yuu = So + S,(L)P, as S..(L)W, aP Sp(L)D; 
+ Sy(L)Y¥, +S AL)6(L) Vivi, 
Pe gee gt Diet Onis) WV (27) 


(26) 


To discuss identification and estimation of the model, we need a 
theory about what is unknown to the econometrician. In constructing 
the model, we have taken the view that all of the variables on the 
right-hand side of the supply and demand curves (24) and (3) (or 
equivalently [26] and [27]) are known to the representative firm. 
Thus, from the viewpoint of private agents, (26) and (27) describe 
exact linear functions of the right side variables in which there are no 
“random errors.”**? The only tractable way that has so far been dis- 
covered of introducing random errors into (26) and (27) has been to 
assume that the econometrician has less information than do the 
private agents. The smaller information set of the econometrician 
leads to what from his point of view are random terms in relationships 
to be derived from (24) and (3) or (26) and (27). The idea is to restrict 
the econometrician’s information set relative to that of private agents 
in a way both that is plausible and that leads to a tractable statistical 
model of the error term. I shall describe two models of the error term 
that can be constructed in this way. 

One model results from assuming that the econometrician has time 
series on {P,, W,, D;, Y,} but never observes the random processes ¢; 
and u,. On this interpretation, e, and u, become random terms in (24) 


33 This is a consequence of the fact that the representative firm views itself as playing 
a dynamic “game against nature,” and so finds it optimal to use a nonrandom strategy, 
that is, a strategy that can be expressed as an exact function of its information variables 
and other state variables. 
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and (3) from the econometrician’s viewpoint.** In constructing the 
model, we have already imposed that V4, is orthogonal to all of the 
variables on the right side of (26) and will assume that V? is orthog- 
onal to Y;. We can also impose that V? is orthogonal to Dy, if we 
wish,?> although we might get by with a weaker assumption. 

The second model of the error term results from assuming that the 
econometrician sees less of D, and W, than do private agents. It is 
convenient to postpone a detailed discussion of this second model of 
the error and, instead, first to discuss identification and estimation 
under the first model of the error term. 


Identification and Estimation 


With this model of the error terms, we can proceed to discuss 
identification and estimation. First, notice that every variable that 
appears on the right side of the demand schedule (27) also appears on 
the right side of the supply schedule (26). The dynamic economic 
theory leading to (26) makes the reason for this clear, since any 
variables that help predict future prices P will appear in the supply 
schedule of the representative firm. This immediately implies that any 
variables that help predict the demand shifters D,, will appear in the 
supply schedule.** The fact that no variables on the right side of the 
demand curve (27) are excluded from the supply schedule (26) means 
that, if the supply schedule is to be identified, the source of identifica- 
tion must be found in restrictions of a kind different from the usual 
exclusion restrictions treated extensively in econometrics textbooks.*” 
According to the standard “order condition” for identification, equa- 


34 This is a version of the model of the error term analyzed by Hansen and Sargent 
(1980a) and Sargent (1978). 

35 We have assumed that V§ and V? are the “innovations” or one-step-ahead errors in 
predicting ¢, and u, on the basis of observations on all variables in agents’ information 
set at time ¢ — 1 (see n. 6). This implies that V¢,, is orthogonal to all variables on the 
right side of (26). If we assume that V¥ is orthogonal to Vé, it also implies that V+ is 
orthogonal to Y,. Imposing that V¢ is orthogonal to D,, amounts to assuming that D,, is 
strictly exogenous in (27), which is stronger than the Granger causality assumptions 
already imposed on D,,, namely, that except for lagged D’s, no other variables in the 
model Granger cause D,,. 

6 There is a singular class of exceptions to this statement. In the special case that 


ED 45 | AD psitecs = ED ip; | {Days $0 (28) 


for all; 2 1, D,,’s will appear in the demand schedule but not in the supply schedule. 
The condition (28) is usually thought to be exceedingly unlikely for any economic time 
series {D,,}. 

37 These remarks about identification should be compared with Milton Friedman’s 
discussion (1953) of the conditions needed for “supply” and “demand” to provide a 
useful categorization of the factors impinging on price and output. Friedman argued 
that the categorization was useful to the extent that it effectively sorted forces acting on 
price and output into mutually exclusive categories. 
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tion (26) is hopelessly underidentified.28 Thus if the parameters of the 
model are to be identified, sources of prior information not of the 
exclusion variety must be available. The main source of these restric- 
tions in the present model is the extensive body of cross-equation 
restrictions embodied in equations (21) and (25). Equations (21) and 
(25) give the parameters of the supply schedule (26) as nonlinear 
functions of the parameters {A, A,, Ao, f, d, B, n, Only), Onl), 64,(L), 
and 6.(L)}. In general, provided that the parameters rp and r,,, which 
determine the order of the autoregressive processes for D and W, and 
the parameters p and q, the number of elements in the vectors D and 
W, respectively, are large enough, these cross-equation restrictions 
identify or overidentify the parameters of the model. The strength of 
overidentification generally increases with increases in the orders rp 
and r, and the dimensions p and q.39 

At this point it is useful to collect together the equations comprising 
the model as 


Yui= So “5 S,(L)P; = S,(L)W, a Sp(L)D, 


(26) 
+ Sy(L)Y;4+ SAL)OAL AV E45 
Pp ay AY, F ASDi, 0, (Ll) Ve (27) 
OL) Wee ¥, (8) 
dp(L)D, = VP, (5) 
where 
So = +f?#An BAod™ — fH,(1)Apo, 
LA bea (AB) 64 Us 
Soh) = +finrpd-*| [ i — Wa ( )] | 
__—nhBf LAU = bu(AB)*8(L) 
Se(L) = APE 4, | Eo el on 


Sp(L) = +f?n Nad Aa p| —— 5 Sato) 


li=NeL 


DEON) Obl 
— fid-tn np] Vb, 


°° The fact that the demand curve excludes some variables that appear in the supply 
schedule is due to the static specification for the demand curve. This feature of the 
model would not survive a variety of alterations that might plausibly be used to 
introduce dynamics into the demand curve. E.g., if the demand schedule involved 
expected future prices as arguments, all variables that help to predict future prices 
would appear in the econometrically operational expression for current P, that would 
correspond to (27). 

%° This characteristic of identification in rational expectations models has been noted 
in various contexts by several authors, including Lucas (1975) and Sims (1980). 
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Equations (26), (27), (8), and (5) form a statistical model for the 
joint process (P;, Y;, W;, D,). The model is linear in the variables but is 
characterized by the extensive set of cross-equation restrictions de- 
scribed by (29). With the model of the error terms currently under 
discussion, the statistical model of the (P;, Y;, W,, D;) process has been 
spelled out sufficiently completely that we could write down the likeli- 
hood function for a sample (P,,Y,,;, D;),t — Ve... 7, assuming a 
normal probability density for (V?, Vi, V?, V;).%° Maximum likeli- 
hood estimates of the free parameters of the model {Ao, 4;, 42, f, 4 
5)(L), 5,(L), 5,(L), 5(L)} could then be obtained. Computational 
details of such procedures are described by Sargent (1977, 1978) and 
Hansen and Sargent (1980a). From the point of view of computing 
the estimates, it is a great practical advantage that (29) gives a set of 
closed-form formulas for the cross-equation restrictions imposed by 
the dynamic economic theory. 


Application of Bayesian Methods 


The fact that for the present model of the error terms it is possible to 
write down a normal likelihood function means that in principle 
Bayesian methods are applicable. Letting 6 be the list of parameters of 
the model and Z be the data, we have 


Tao ee 
post 7 ; 


or 
foot |Z} =1Z | O)fonok@/[1(Z | O)fonox(0)4 6, (30) 


where fposi{@ |Z} denotes the posterior probability density, f(Z) the 
probability density of Z, fyrio{} the prior density on 6, and 1{Z | 6} 
the likelihood function. Measures of the location and dispersion of 
the posterior distribution of 6 can be calculated, for example, by 
integrating 6* - fyos{0|Z} over 6 for appropriate values of k. In the 
Bayesian view, the role of data analysis is to trace out in as revealing a 
way as possible the mapping defined by (30) from the prior to the 


40 See Hansen and Sargent (1980a) for a discussion of the details. 
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posterior distribution. For such an analysis to be practical, it sub- 
stantially eases matters if the mapping (30) can be characterized 
analytically so that, for example, posterior moments such as 
J Foosel 0 | Z )d@ can be calculated without the need to resort to numeri- 
cal integration. Zellner (1971) and Leamer (1978) describe forms of 
prior densities fprior(@) that have the property that the mapping (30) is 
one that can be written as an analytic closed form when /(Z | @) is the 
normal likelihood function. 

In the context of dynamic economic models of the class represented 
by (26), (27), (8), and (5), the question of whether the mapping (30) 
can be characterized analytically hinges on which parameters one 
regards as being in the list @ about which the researcher has formu- 
lated prior information. One possibility is that 6 consists of the S’s of 
(26), the A’s of (27), and 5,,, 8,, and 8, of (27), (8), and (5). With this 
interpretation of 6, then since (26), (27), (8), and (5) are linear in the 
Ss, 4 'S76,.1nd'o, it iS possible to get analytic characterizations of the 
mapping from fprior(A) to fpost(O UZ). For example, Leamer (1972) and 
Shiller (1972) have shown how priors of various forms on the S’s in 
(26) can tractably be mapped into posteriors, in contexts where (26) is 
appropriately viewed as a regression equation. In effect, Leamer 
(1972) and Shiller (1973) provided formal Bayesian methods for 
imposing restrictions on lag distributions of a general kind, examples 
of which had long been imposed by applied econometricians. These 
restrictions usually corresponded to restrictions directly on our S's. 
Predating the work of Shiller and Leamer were the restrictions on 
distributed lags proposed by Koyck (1954), Cagan (1956), Milton 
Friedman (1957), Almon (1965), and Jorgenson (1966). There was 
also the frequently used identifying restriction that various distrib- 
uted lag weights sum to unity.*! All of these approaches view the S’s 
themselves as among the free parameters of the model about which 
the researcher can reasonably be imagined to have formed views 
summarized by a prior distribution. 

Unfortunately, the tractability of the Leamer-Shiller approach is 
purchased at the cost of ignoring the essential aspects of the dynamic 
economic theory leading to (26). According to that theory, the S’s are 
not free parameters but are complicated functions of the parameters 
HAGA fd, Ben, 0,12), Op), 0,40), 0.(L) tit is this list-of 
parameters about which it seems most appropriate to expect an 
economist to have prior beliefs. The parameters {Ao, A, Ay, f, d} are 
the parameters describing preferences and the technology, about 
which the economic theorist may have some prior beliefs. The 


* This restriction was criticized by Lucas (1972) and Sargent (1971) for essentially 
the same reasons given here. 
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economists’ prior beliefs about the parameters 18.9 Ops Onn Oaraabe 
presumably on a different theoretical footing than his beliefs about 
{A , Ay, Ao, f, d}, since the former list simply characterizes the serial 
correlation properties of the “shift variables” about which economic 
theory itself suggests little, although casual general observations may 
suggest a presumption in favor of high serial correlation, at least in 
some types of variables. In any event, it is the deep parameters {Ao, 
A,, Ao, f, d, Bn, Sy, Sp, Sy, 5} that must be estimated, if one is to build 
a model that potentially overcomes Lucas’s critique of econometric 
policy evaluation procedures.” 

When this list of deep parameters contains the objects of interest, 
Bayesian analysis using (30) becomes much less tractable. This is 
because the likelihood function /(Z | @) becomes a very complicated 
function of the free parameters in 6, by virtue of the complicated 
nature of the cross-equation restrictions illustrated in (29). Although 
Bayesian analysis is still possible, the researcher will be forced to use 
numerical methods to characterize the mapping from the prior to the 
posterior given in (30). For example, for a given prior, numerical 
integration will have to be used to calculate the moments of the 
posterior distribution. My own judgment is that given current com- 
puter technology, formal Bayesian estimation procedures seem pro- 
hibitively expensive for most members of the class of dynamic models 
considered here. This is obviously not an objection to Bayesian 
methods in principle. However, I believe that the high cost attached to 
applying Bayesian methods correctly helps to explain why they have 
not yet been applied extensively to estimating rational expectations 
models. 


A Second Model of the “Error Term”’ 


More serious limitations on the domain of Bayesian techniques 
emerge if the researcher embraces a second model of the error term, 
which we now discuss. In the second model of the error term, it is 
assumed that the econometrician possesses only observations on sub- 
sets W, C W, and D; C D, of the information variables that private 
agents use to forecast future w,’s and D,,’s.*° It is assumed that these 


“ A point related to that raised in n. 29 is relevant here. Priors and posteriors that 
assign positive probability to points in regions for which zeroes of det 8(z) are less than 
VB in modulus are inadmissible. This is because under such distributions, for some 
regions in the parameter space with positive probability, the dynamic optimum prob- 
lems are not well posed. Taking this into account would substantially complicate the 
analysis since it would involve using mathematically less tractable distributions. 

43 This model of the error term was originally proposed by Shiller (1972) in a related 
but somewhat different context. The model was applied in the present context by 
Hansen and Sargent (1980a). Nerlove, Grether, and Carvalho (1979) also recommend 
Shiller’s model of the error term. 
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subsets of information variables follow autoregressive processes 
5p(L)D, = V? and 5,,(L)W, = Vv, where §,(L) and 5,,(L) are polyno- 
mials in the lag operator of order 7p and ?,,, respectively. Then it turns 
out that the equilibrium law of motion for capital, (20), can be written 
in a form identical to (20), except that W, D, 5p, y, Hp, and H,,, are to 
be replaced by the corresponding objects with tildes above them, and 
that there appears an additional random disturbance 7», on the right 
side of (20). The cross-equation restrictions (21) continue to charac- 
terize the objects with tildes over them. The random variable 7, can 
be shown to be orthogonal to all of the current and lagged values of W 
and D.*° However, it turns out that 7, is in general serially correlated, 
with serial correlation properties that depend on the joint covariance 
properties of those variables in D, and W, that the econometrician 
does not have observations on. In the context of this setup, it is not 
even possible to write down the likelihood function without specifying 
details of the moments of information variables in D and W that are 
unobservable to the econometrician. It would seem attractive to adopt 
an estimation procedure that avoids the implicit theorizing about the 
stochastic properties of the unobserved D’s and W’s that an estimator 
using the likelihood function requires. 

One such estimation strategy that exploits the orthogonality of 7 to 
D and W, without requiring all of the added details required to write 
down a likelihood function, has been developed by Hansen (1979). 
The “generalized method of moments” estimators of Hansen have the 
advantage of delivering estimators of the free parameters whose 
desirable statistical properties do not depend on any arbitrary as- 
sumptions about the serial correlation properties of the 7,’s.46 These 
generalized method of moments estimators were invented precisely to 
handle situations in which the researcher is substantially more 
confident of the orthogonality conditions delivered by his theorizing 
than he is about the serial correlation properties of the error. These 
methods construct statistically consistent estimators, while avoiding 
the need to form the likelihood function. However, in acknowledging 
that he does not have enough information about the disturbances to 
construct the likelihood function, the researcher loses the ability to 
employ Bayesian methods, since knowledge of the likelihood function 
is essential for using Bayes’s law as in (30). 


44 See Hansen and Sargent 1980a. 

Ibid. f 

46 Under regularity conditions provided by Hansen (1979), the estimators of the 
underlying parameters are shown to be consistent and most efficient within a restricted 
class of estimators. Hansen’s discussion of the conditions for consistency, which also has 
implications for the conditions for consistency of maximum likelihood estimators, is at 
this date the key reference on issues of statistical consistency in linear rational expecta- 


tions models. 
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Interrelated Industries 


In the Lucas-Prescott model of a single industry, state variables which 
help the firm predict the future prices of inputs appear in the repre- 
sentative firm’s decision rule. The laws of motion of these input prices 
have been taken as given from outside the model. In actuality, the 
prices of these inputs are usually thought to be determined by trades 
in another market, one source of demand for which stems from the 
industry being modeled by Lucas and Prescott. If this other market is 
modeled explicitly, nontrivial modifications also occur in the analysis 
of the original industry. Thus, consider the example of a corn-hog 
model in which part of the output of one industry, corn, is an input 
into the production of the other industry, hogs. If the technology is 
such that hog producers have an incentive to forecast future corn 
prices, it follows that state variables that appear in the laws of motion 
for the total output and price of corn will also appear in the decision 
rule of the representative hog producer. Also, because the corn pro- 
ducer has an incentive to forecast the price of corn, which depends 
partly on the demand for corn from hog producers, the state variables 
that appear in the laws of motion for total hog output and the price of 
hogs will appear in the optimal decision rule of the representative 
corn producer. Hence, each industry inherits the state variables of the 
other. Furthermore, the equilibria in the two industries must be 
defined jointly, since the laws of motion for endogenous market-wide 
state variables are simultaneously determined for the two markets. 
There continues to be a fictitious social planning problem that a 
rational expectations equilibrium solves, one that is a version of an 
interrelated factor demand problem in which the planner is jointly 
optimizing the performance of both industries.47 

These remarks indicate that the analyst will often face a hard 
practical decision about which dynamics he takes as given from out- 
side the model. The internal logic of this class of models tends to 
propel the analyst toward a general equilibrium formulation in which 
the laws of motion characterizing each distinct industry are deter- 
mined simultaneously with the laws of motion for all industries with 
which it buys and sells. In any given application, the researcher will 
have to choose what laws of motion he takes as given from outside the 
model, for the purposes of the analysis at hand. 


Conclusions 
Remaking dynamic econometric practice so that it is consistent with 
the principle that agents’ constraints influence their behavior is a task 


47 These claims are proved for a particular version of a “corn-hog model” in some 
unpublished notes (Sargent 1980a). 
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that is far from finished. Further, properly allowing for the implica- 
tions of the principle will surely require abandoning many presently 
received ways of interpreting data. A variety of setups can be imag- 
ined that are consistent with the principle. For example, a variety of 
variations of the setup of this paper can be imagined in which agents 
optimize but have smaller information sets than have been attributed 
to them here. Also, information discrepancies across classes of agents 
can be assumed. In many such cases, endogenous variables such as 
prices will play an important role in conveying information to agents. 
In models with dynamics as complicated as those of our examples, 
these variations introduce substantial analytical difficulties. To date 
there is very little work which investigates the econometric implica- 
tions of such complications to setups like ours. With or without these 
complications, building a dynamic econometrics that is consistent with 
our simple principle from economic theory is a challenging task. It 
is sure to require substantial changes in the ways that applied 
economists interpret economic time series. 
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The purpose of this paper is to argue that the pursuit of an activist 
monetary policy may make economic sense even when people’s ex- 
pectations are formed rationally. The paper presents a simple model 
in which (1) prices are costly to adjust, (2) there is uncertainty 
concerning the parameters affecting aggregate demand, and (3) 
there are positive costs of gathering and processing information. By 
reference to this model it can be shown that an activist monetary 
policy may or may not be useful in offsetting aggregate disturbances, 
depending upon the extent of information costs and of parameter 
uncertainty. 


I. Introduction 


By an activist monetary policy I mean a policy whereby the monetary 
authority permits the money supply to react systematically to infor- 
mation concerning aggregate disturbances. The proposition com- 
monly known as the Sargent-Wallace (hereafter SW) proposition! 
asserts that an activist monetary policy cannot succeed in offsetting 
aggregate disturbances when people’s expectations are formed 
rationally unless the monetary authority possesses information un- 


An earlier version of this paper was presented at the meetings of the American 
Economic Association, New York, December 1977. I am indebted to Russell Boyer, 
Stanley Fischer, Joel Fried, Herschel Grossman, Robbie Jones, David Laidler, Robert E. 
Lucas, Jr., Michael Parkin, Arthur Robson, and an anonymous referee for helpful 
comments and criticisms, and to the Social Sciences and Humanities Research Council 
of Canada for financial support. , 

* See Sargent and Wallace 1975. The proposition is derived in a more general context 


by Barro (1976). 
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available to private agents. It asserts that in the absence of such supe- 
rior information systematic variations in the money supply will be an- 
ticipated and thus neutralized even in the short run. 

The SW proposition has been criticized by previous writers. Taylor 
(1975) pointed out that it will take private agents time before they 
learn the exact nature of the monetary policy.” During this transition 
period the SW proposition is invalid. Poole (1976), Fischer (1977), and 
Phelps and Taylor (1977) have pointed out that even if private agents 
possess the same information as the monetary authority they may not 
be able to act upon it immediately because of contractual precommit- 
ments, in which case the SW proposition is invalid. It has also been 
widely acknowledged that if the arguments underlying the SW prop- 
osition were valid it would be difficult to explain the observed persis- 
tence of movements in output and employment. 

The present paper focuses on another objection to the SW proposi- 
tion: namely, that it fails to take into account (1) the costs of gathering 
and processing information and (2) uncertainty concerning the 
structure of the economic system. People may choose not to collect 
and process the monetary authority’s information concerning aggre- 
gate disturbances when it is costly to do so.* Indeed, everyday obser- 
vation suggests that most people pay no attention to such information. 
In such circumstances people may still be forming their expectations 
rationally, in the sense that they are making efficient use of (costly) 
information, yet they are not in a position to anticipate and neutralize 
the effects of an activist monetary policy. In order for the SW propo- 
sition to be true the monetary authority’s information must not only 
be available to private agents it must also be economically usable by 
private agents. 

This qualification to the SW proposition has been noted by Barro 
(1976), who argued nevertheless that the monetary authority could do 
Just as well by disseminating its information as it could by varying the 
money supply. However, this paper attempts to show that such a 
disseminating policy may be inferior to an activist policy when there is 
uncertainty concerning the values of parameters in the economic 
system because (1) people may not have enough incentive to process 
information, and (2) even if people do process the information there 
is no guarantee that this will produce the same outcome as would a 
properly designed monetary policy. A demonstration of this argu- 


* A similar point was made by Benjamin Friedman (1979), who showed the similarity 
of rational expectations to error learning during the learning process. 

* See, however, Lucas 1975 and Sargent 1976. 

* This point has been stressed by Rutledge (1974), Frenkel (1975), Feige and Pearce 
(1976), Laidler (1978), Shiller (1978), and Friedman (1979). 
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ment can be made precise only in terms of a formal model, but it can 
be sketched in the following terms. 

First, even if the information concerning aggregate disturbances is 
provided to people they must still incur the cost of processing it if they 
are to use it. But they may not find it economical to incur that cost if 
they think that not many other people will be using the information, 
even when its social value exceeds the processing cost. Consider, for 
example, an agent who is deciding what price to charge for the single 
good that he sells. Suppose he is told that the demand for money has 
doubled. If he thinks that all other agents are using this information 
he can calculate that his best course of action will be to cut his price in 
half, along with everyone else. The information may be quite valuable 
to him and to society because failure to conform in reducing his price 
would leave him charging a suboptimal relative price. But if he thinks 
that no other agent is using the information his best course of action 
will depend on the effects of that change in the demand for money 
on the demand for his product. If this effect is difficult to estimate 
accurately then the information may have little value to him. A con- 
vention like universal indexing that leads people to believe that 
everyone else will be using the information as well may be necessary 
for the private value of such information to be as great as its social 
value. Everyday observation again suggests that such a convention 
rarely exists. 

Second, in a world without complete Arrow-Debreu contingent- 
claim markets there is no guarantee that even an otherwise frictionless 
competitive equilibrium will be efficient with respect to the informa- 
tion possessed by private agents. It is possible that the monetary 
authority is in a position to make better use of aggregate information 
than are private agents, even if the above-mentioned indexing con- 
vention could be established. In the example of the previous para- 
graph, Brainard’s (1967) analysis suggests that if the values of the 
parameters of the economic system are not known with certainty then 
the optimal reaction of the monetary authority is to allow the money 
supply to increase, but by less than 100 percent. But if the informa- 
tion is disseminated prices may indeed fall in half, which would be 
equivalent to a 100 percent increase in the money supply. As in the 
theory of the second best, full information results in the best equilib- 
rium, but when information is incomplete then more information 
does not necessarily result in a better equilibrium.® 


> This result is similar to those of Hirshleifer (1971) and Grossman and Stiglitz 
(1976), who argued that the equilibrium allocations with more information may be 
Pareto inferior to those with less. 
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The remainder of the paper attempts to make these arguments 
more precise by developing a simple model of a monetary economy in 
which prices are costly to adjust. Section II describes the basic features 
of the model; Section III analyzes the private decision whether or not 
to use information; Section IV characterizes the optimal policy of the 
monetary authority showing how an activist policy may be justified by 
the first of the above arguments; Section V considers a variant of the 
model in which the second of the above arguments may be used to 
justify an activist policy; and Section VI contains some concluding 
remarks. 


Il. The Basic Model 


Preliminaries 


This section describes a simple hypothetical economy possessing the 
features described above. In this economy there are two tradable 
objects: money and a good. There are also two classes of economic 
actors: a monetary authority, whose job it is to determine how much 
money will exist and to distribute the money, and a set of private 
agents (referred to simply as “agents”), who produce and consume the 
good. In the interest of simplicity we suppose that each agent pos- 
sesses identical tastes, technology, and initial endowments. (Thus we 
are making no distinction between firms and households.) In order to 
establish a basis for exchange among these identical agents we assume 
that each of them has an absolute aversion to consuming units of the 
good that he himself has produced. Thus he is induced to sell all of 
these units and to purchase for consumption units of the good that 
are produced by others. 

To keep the demand and supply functions simple we suppose that 
each agent evaluates his demands according to a Stone-Geary utility 
function and produces subject to a quadratic cost function. That is, 
each agent’s utility during a period depends upon g, the amount of the 
good consumed during the period, m, the real value of money bal- 
ances held at the end of the period, and q, the amount of the good 
produced during the period, according to the particular function 


U = K@*(m + x)-* — (k/2)q? (1) 


where k is a positive, nonstochastic parameter equal to the slope of the 
agent’s marginal cost schedule, a and x are independent stochastic 
parameters with 0 <a < 1, and K isa stochastic parameter defined as 
K =a““(1 — a)“. For reasons that will become apparent later we 
shall refer to the parameter x as “the level of aggregate demand.” 
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In order for purely monetary changes potentially to have real 
effects even in the short run we must not assume that there exists a 
central Walrasian auctioneer who is able each period to compute a 
single market-clearing price at which all trades are to occur. The 
actual transaction prices must be determined before the agents have 
collected enough information for them to be able to compute this 
Walrasian equilibrium price. We accomplish this by assuming that 
each agent is responsible for setting his own selling price. He must set 
the price before learning exactly how much he will be able to sell, and 
all trades must occur at these posted prices whether or not markets 
are clearing in the usual sense. Assume also that the quantities trans- 
acted at these prices always equal the quantities demanded. In other 
words, when markets are not clearing the sellers bear all the resulting 
quantity rationing.” 

At the time of the price-setting decision each agent’s expected 
utility will depend negatively on the variance of his error in forecast- 
ing demand, because he must produce subject to increasing marginal 
cost. Thus he will have an incentive before setting his price to gather 
and process information concerning the demand function facing him 
so as to reduce the variance of this forecast error. The decision on 
what information to use at this stage will be the main focus of the 
following analysis. 

When determining the value of M, the nominal per capita stock of 
money, the monetary authority is assumed to be unaware of the exact 
state of aggregate demand. Suppose, however, that he is able to 
acquire partial information by observing a variable, s, that is corre- 
lated with aggregate demand. In particular, assume that 


x = CS (2) 


* This setup may be contrasted with Lucas’s (1972), in which the agents are randomly 
divided into subgroups, each with a miniature Walrasian auctioneer, and with no 
communication between groups during the market period. Both setups require agents 
to make trading commitments before being completely informed of other agents’ 
dispositions to trade. Lucas’s approach has the advantage that market-clearing models 
have been more thoroughly analyzed in the literature than price-setting models. Ours 
has the advantage of replacing the Walrasian auctioneer with a scheme that bears some 
resemblance to the way prices actually get determined in many real-world markets. 
Whether our results could be transposed to a suitably constructed “market-clearing” 
model is an open question. 

‘This rule also appears to be compatible with the way transaction quantities are 
determined in many real-world markets, especially those organized by specialist 
traders—wholesalers, retailers, etc.—who post prices and hold inventories which they 
allow to act as buffer stocks against fluctuations in demand. By absorbing the nonprice 
rationing they provide a service to customers, for which they are rewarded by a spread 
between their buying and selling prices. In effect we are assuming that each agent is a 
combination consumer-producer-trader. (However, we are ignoring the potentially 
important role of inventories by assuming that the good is not storable.) 
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where c is a stochastic parameter. We shall refer to s as the “indicator” 
of monetary policy. The monetary authority is assumed to obey the 
following rule:® 


Mos sMi 1a 25) (3) 


where M (> 0) is the average per capita stock of money and g is the 

“policy reaction coefficient.” Thus the monetary authority follows a 
“feedback” rule of reacting to observable information concerning the 
level of aggregate demand, and its two decision variables are M and g. 
The SW proposition asserts that the stochastic behavior of aggregate 
output in the economy will be independent of the monetary author- 
ity’s choice of g. 


The Sequence of Activities 


During each period activities proceed according to the following 
sequence. First, there is a stage in which the monetary authority 
decides on his policy. That is, he chooses values for g and M. When 
making this decision he is assumed to possess an accurate model of the 
economy, including exact knowledge of all deterministic variables, 
parameters, and functions, and the probability distributions of all 
stochastic variables. He does not, however, know the exact value that 
will be assumed during the period by any stochastic variable. As soon 
as he makes his decision all of his information is conveyed to each 
private agent, including the chosen values of g and M. Thus there is 
no scope for the monetary authority to fool private agents, even in the 
short run, about the nature of his policy, or to make this policy 
decision using information that is not available to private agents. 

Second, there is a stage in which private agents make their 
monitoring decision. That is, each one must choose whether to pay 
the cost of learning the exact value of s before setting his price, in 
which case he will be able to reduce the variance of his forecast error, 
or to wait until after his price has been set, when the value of s will be 
revealed to him at no cost. In the interest of simplicity we assume that 
the indicator is the only stochastic variable whose value is potentially 
observable before the price-setting decision is made. 

Third, there is a stage in which the exact value of the indicator s is 
determined. Any agent who has decided to monitor observes s and 
pays the cost at this stage, as does the monetary authority (unless he 
has set g = 0). Note that once again we are not giving the monetary 
authority any informational advantage. At this stage, before he 


* A purely random, unpredictable component could be added to (3) but, as Barro 
(1976) has also shown, the optimal policy, if feasible, would be to set its variance to zero. 


ACTIVIST MONETARY POLICY 255 


chooses his price anyone can choose to be as well informed as the 
monetary authority. 

The fourth stage is the one in which the values of all prices and of 
the money supply are decided. The money supply is determined 
automatically by equation (3), and the private agents choose their 
prices in such a way as to maximize expected utility. If they have 
decided to monitor the indicator (to learn the value of s), then private 
agents can forecast the exact value of M by using equation (3) and can 
use this knowledge in making their price decision. Otherwise they 
must make their decisions on the basis of expectations concerning M, 
which they form using (3) and their knowledge of the probability 
distribution of s. When setting his own price no agent is able to 
observe the others’ prices. However, he is able to tell whether or not 
the others are monitoring the indicator, and this will be enough in 
most cases to allow him to predict exactly what the others’ prices will 
be. 

The fifth stage is the one in which all uncertainty is resolved. In this 
Stage the monetary authority distributes money to the private agents 
in equal amounts, and the exact values of a,x,K,c, ands are revealed 
to everyone. Whatever values of the utility parameters a,x, and K are 
determined in this stage, they are shared by all agents in common. 
Thus, with this information each agent is able (i) to calculate exactly 
how much his sales will be for the period and (ii) to compute his own 
demands for money and the good. 

In the sixth and final stage the agents visit one another to execute 
their planned purchases, and each agent must produce to meet the 
demands placed upon him. Up until this stage each of the identical 
agents will have been led to make identical decisions. Thus every 
agent will be charging the same price. Assume that in such a situation 
the quantity demanded from each agent will be identical.° 

To understand what happens in this sequence we must start at the 
end and work back. Consider an agent who, during the sixth stage, is 
going to be called upon to produce the quantity g. Suppose that all 
other agents have set the same price, P, and that he himself has set the 
price rP. Then during the fifth stage he will be aware of the value of q 
and of the exact values of all the parameters in his utility function (eq. 
1). He will formulate his own demands, g and m, so as to maximize this 
utility function subject to the budget constraint 


m+g=m+t+rq (4) 


® Alternatively, we could suppose that they all face a stochastic demand function with 
independent, identical probability distributions, the randomness representing the in- 
determinacy in the distribution of sales among identical sellers charging identical 
prices. However, the addition of one more random factor would add nothing but 
notational complexity to the analysis. 
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where m = M/P denotes his initial holding of real balances.’ Thus his 
demand for the good will be 


Gg =a(m+r7rq +x), (5) 


and by substituting from (4) and (5) into (1) we can see that the level of 
utility that he will obtain is determined by the indirect utility function 


Ving: 4.1) = Wit 1g ni (ki 2g (6) 


During the fifth stage each agent will also be able to predict his 
sales, g, in the following way. First, suppose that every agent is charg- 
ing the same price, P. Then every agent will sell the same quantity, q, 
and will formulate demands according to (5) with r = 1. But every 
agent will also realize that his sales, g, will be equal to the per capita 
demand, g. Thus the values of q and g can be computed by substitut- 
ing q = @ into (5), with r = 1, to produce 

q = b(m + x) (7) 
with 
a 


i > 0. (8) 
l-a 





Equation (7) indicates the amount that each agent would demand in 
stage 5, as well as the amount that he would expect to sell, under the 
assumption that r = 1. And indeed the equilibrium will be one in 
which r = 1. But to see how prices are chosen we must indicate how 
much each agent would expect to sell ifr 4 1. Let us suppose that this 
quantity is given by the expression 


q(m, x, b, r) = max [b(m + x) + n(1 — 1), 0] (9) 


where n > 0 is a measure of the “conjectured competitiveness” in the 
economy. In other words, it measures the extent to which each agent 
supposes that he would lose sales by raising his price above the market 
price. The limiting case in which n = +~ is the one usually associated 
with the assumption of perfect competition. 


* It has been argued that even in markets where demanders choose quantities at 
predetermined prices the demand decisions must take into account the future relation- 
ships between trading partners, as well as the usual objectives, in which case the 
quantities chosen may be on neither the demand nor the supply curve as usually 
conceived (Barro 1977). This consideration is probably important in markets where 
exchange is conducted on a highly personal basis because of the costs of switching 
trading partners, as in labor markets, markets for personal credit, and so forth. 
However, there are also many markets where exchange is quite impersonal and buyers 
may, because they can remain anonymous to the sellers, choose their quantities without 
taking this consideration into account. When we assume that demanders choose their 
quantities according to the usual sort of utility maximization we are implicitly assuming 
this sort of impersonal arrangement. 
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We now have introduced a total of seven random variables: b,c,s,x, 
a,K, and M. Let us suppose that the probability distributions of the 
first three of these are mutually independent and that 


E(6)=B>0;E(¢)=y>0;E() = 0 
var (6) = of > 0; var (c) = o2 > 0; var ($s)=o?>0. 


(10) 


At the outset of the fourth stage each agent realizes that if he sets 
the price rP his utility will equal 


S| 


M k\ (M 2 
(lie ps) oF cs rg p il ~ 3), 68, b,7| = (x)a lp 2 — 6), 08, 5,7]. 
(11) 


(This is obtained by substituting from [2], [3], and [9] into [6].) He will 
know that the stochastic variables b, c, and s obey (10), and he may or 
may not know the exact value of s, depending on whether or not he 
has chosen to monitors in the second stage. Suppose that in stage 2 he 
has made the same monitoring decision as every other agent. Then he 
will be able to calculate P, the price set by all other agents. He will also 
know the values of M, g, and k. Thus he will choose a relative price so 
as to maximize the expected value of (11) conditional on the informa- 
tion set /, where J = {s} if he has monitored and/ = @ otherwise. The 
first-order condition of this maximization (assume that g[m,x,b,r] >0 
w.pr.1) can be expressed as 


E{b[M(1 — gs\/P + cs] + n(1 — 1) |I} = rq, (12) 
where 

Gn = ni(1 + nk). (13) 
Let the superscript7 denote the information set,i = 1 denoting / = {s} 
andz = 2 denoting J = @. Then the solution to (12) can be written as 

the relative-price function 
N= (PIM Fess) r= ily B (14) 
In order to tell what price to set each agent must also calculate P, the 
price that the other agents each will set. To do this he calculates the 
value of P that makes the value of r in (14) equal to unity. That is, he 
expects the price to be one such that every agent (including himself) is 


induced to conform. This will also be the actual price set by all agents 
in stage 4. It can be expressed as 


P = Pi(M, g,5),7 = 1, 2, (15) 
and, from (12), it must satisfy the condition 


E(q|1) = E{b[M(1 — gs)/P + cs]|I} = Gy. (16) 
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In the limiting case of perfect competition, (16) states that the market 
price will be set so that the expected market demand (per capita) 
equals the competitive market supply (per capita), g = lim q, = 1/k. In 
other words, there is expected market clearing." "~~ 

Let m, = Gn/B. From (10), (15), and (16) the prices can be expressed 
as 


P1(M, g,s) = M(1 — gs)/(m, — ys), (17) 
P2(M, g,s) = Mim. (18) 


Thus the quantity of output that will be produced per person in stage 
6 can be expressed as a function of the information set, the reaction 
coefficient g, and the random variables b,c, ands by substituting from 
(2), (3), (17), and (18) in (7) to obtain 


qi = b[m, + € — ys], (19) 
q2, = b[m,(1 — gs) + cs]. (20) 


The essence of the SW proposition is contained in (19). If agents 
monitor s then the behavior of the quantity q is independent of the 
reaction coefficient g. On the other hand, (20) shows that q is affected 
by g if no monitoring occurs. 

From (10), (19), and (20) the mean and variance of output per 
person, as seen at the beginning of stage 3 (i.e., before the exact value 
of s has been ascertained), can be expressed as 


E (q}) = E(qi|s) = E(q2) = Gn 4 E(q@2|5); (21) 
var gi = o}m2 + 02(B? + o%)02, (22) 
var g2 = of + 02(B? + o3)lo2 + (y — gm,)*). (23) 


From (6) and (17)—(21) the utility that each agent expects at the end of 
stage 2 may be expressed as 


Vi(g) = EV(MIPi, gi,,x, 1) =m, + Gn — (&/2)g2 — (k/2) var gi; 7 = 1, 2. 
(24) 


It follows from (22)—(24) that the gain to society from everyone 
monitoring the indicator (i.e., Vifg] — V2[g]) arises solely from the 
consequent reduction in the variance of output. This reduction in 
variance yields a welfare gain because the marginal utility of more 
production is (by assumption) a constant but the marginal cost is 
increasing, so that total utility is a concave function of the level of 
output. 


11 An almost identical condition was assumed to hold by Fischer (1977, pp. 195-97) 
and by Phelps and Taylor (1977, pp. 166-70). 
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III. The Monitoring Decision 


Equations (17)—(24) show how each agent’s price, output, and ex- 
pected utility will depend on the monitoring decision made in stage 2 
and the reaction coefficient chosen in stage 1. This section analyzes 
the monitoring decision. 

Let zm (> 0) denote the cost, in utils, of monitoring the indicator. 
This cost will generally equal z, + z, where z, is the cost of collecting 
the information (i.e., of determining the exact value of s) and z, is the 
cost of processing the information (i.e., of computing and posting the 
optimal price given the value of s). Each agent will choose to monitor 
if the gain to him in expected utility is at least as great as zp. 

The size of this gain will depend on whether or not the other agents 
are monitoring. Consider first the case in which none of the other 
agents is monitoring. In this case the agent will derive the expected 
utility V3(g), as determined by (24), if he too chooses not to monitor. 
But if he chooses to monitor, the market price will continue to be P2, 
as in (18), and he can now profit from knowing the exact value of the 
indicator before setting his own price. Because he can infer, as before, 
the exact value of P before determining his relative price, that relative 
price will be the one that satisfies the first-order condition (12) with J 
= {s} and P = P2(M, g;s). That is, 


ri(P2,M,g,s) = 1+ sBly — gm,)/(q, +n). (25) 


Let V3(g) denote the expected utility of the agent who monitors when 
no one else is monitoring. This can be obtained by substituting from 
(25) into (6) and using g = q3 = q? + n(1 — 7) with q? given by (20). Asa 
result of this calculation it can be seen that the gain in expected utility 
from deciding to monitor when others are not monitoring is 


Vi(g) — Vilg) = d(n)o5B?(y — gm)? (26) 
where ¢(n) = (1 + nk)?/2n(2 + nk). Note that 
b(n) > (k/2) and d(n) > (k/2) asn > ~, (27) 


The nature of this gain is easy to see. The expected output of the 
agent is not affected by the decision to monitor but its variance is, and, 
as we have argued, a reduction in variance increases expected utility. 
Specifically, whenever the level of aggregate demand is high (s > 0) 
the monitoring agent will, according to (25), raise his price, thereby 
damping the increase in his sales (assuming that g < ym;'). Likewise, 
when s < 0 the monitoring agent will bolster his sales by reducing r. 
Not only does this reduce variability it also adds to the agent’s ex- 


12 This result is also derived in the Appendix. 


260 JOURNAL OF POLITICAL ECONOMY 


pected revenue. In the limiting case of perfect competition, (25) 
implies that these changes in r will be imperceptible. In this case the 
agent is able to adjust the mean position of his random sales schedule 
whenever a nonzero value of s is observed so that the expected value 
of his sales conditional upon s is equal to q,"* and the entire gain to the 
agent from deciding to monitor arises from the resulting reduction in 
the variance of sales. In particular, (26) and (27) imply that the gain is 
equal to (k/2) times this reduction in variance.” 

Next, consider the case in which all of the other firms are monitor- 
ing. In this case the agent will derive the expected utility V}(g) as 
determined by (24) if he too chooses to monitor. But if he chooses not 
to monitor then the price will continue to be P} as in (17), and the 
agent will lose by not being able to foresee, before setting his own 
price, the variations in s and in P} that his competitors are taking into 
account. The nonmonitoring agent will choose a price P so as to 
maximize EV[M/P}, qi + n(1 — P/P}), x, P/P1]. Let V4(g) denote the 
expected utility of the agent who does not monitor when others are 
monitoring. Then, as the Appendix demonstrates, the gain in ex- 
pected utility from deciding to monitor when others are monitoring is 


var €y 


VAG Mia) aD a gs 


(28) 


where e, = P1(M; g, 5) [EP1(M; g, Ss) *]"' is the reciprocal of the 
market price, expressed as a ratio to its mean value, and where w(n) = 
n + n*k/2. Note that 


U(n) > © asn > o, (29) 


If everyone is to monitor then the gain from deciding to monitor 
when everyone else is monitoring must be at least as great as the 
monitoring cost; that is, 


Vnlg) — Vile) 22m: (30) 
Likewise, if no one is to monitor, it is necessary that 
Ve) Ve (oe (31) 


From (26) and (28) it follows that for large enough n at least one of 
these conditions must be satisfied. But it also follows that both of them 


* The limiting case of perfect competition is denoted by the absence of the subscript 
n on the variables 4, my, Vi, 8, etc. 

“ Notice that we are not allowing a monitoring agent to gain by selling his informa- 
tion. Allowing a market for information would complicate the analysis without affecting 
our main result. For in the case that we focus on below where an activist policy 
dominates all others the private value of information is less than the cost of processing 
it. In this case no one would willingly pay the monitoring agent anything for his 
information. 
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may be satisfied. In other words, convention appears to play an 
important role in the monitoring decision. It may be that everyone 
will monitor if they all expect everyone else to, but no one will monitor 
if they all expect no one else to.!° I shall assume that monitoring will 
occur if and only if (31) is violated—in other words, that the conven- 
tion of monitoring will develop only when the convention of non- 
monitoring is unsustainable.!® Notice, however, that when the reac- 
tion coefficient assumes the value 


gn = ylm, (32) 


then the convention of monitoring could never be sustained as long as 
Zm > 0, because from (17), (26), and (28), 


Viul&n) — Vnln) = Vien) — VilGn) = 9. (33) 


In order to simplify the analysis further, let us assume the limiting 
case of perfect competition. Then, from (2), (26), (27), (31), and (32) 
the necessary and sufficient condition for monitoring to occur is 


(K/2)A.o7B°(1 — gig) > Xm (34) 


where \, = y”/(y? + a2) is a measure of the accuracy of the indicator, 
being the square of the coefficient of correlation between x and the 
conditional forecast ys of x based on the indicator, o?. = var x is the 
variance in aggregate demand, and ¢ = yk, the limiting value of (32). 

According to (34), the likelihood of monitoring occurring is larger 
(i) the larger is the slope, k, of the typical producer’s marginal cost 
schedule, because the larger this slope the more the firm loses from 
variability in its demand that could be avoided by monitoring; (ii) the 


15 In such cases the private value of the information contained in s is larger when 
everyone else has the information than when no one else has it. This result may appear 
somewhat counterintuitive, but it arises from the following considerations. In stage 4 
the agent who knows s may gain because this allows him to vary his price in anticipation 
of variations in (a) the variable x which is correlated with s, and (b) his competitors’ 
price, which may be predicted once s is known. When n is large, then b will be the 
dominant factor if others are observing s, because variations in his competitors’ absolute 
selling price imply variations in his own relative selling price, and thus large variations 
in his sales, if he is unable to anticipate them. But if no one else is observing s then 6 no 
longer applies because he does not need to observe s in order to anticipate his competi- 
tors’ price. More generally, the result arises because the main use of the information is 
to allow the agent to coordinate his activities (specifically the activity of setting his 
nominal selling price) with the activities of other agents. As an analogy, the value of 
knowing that you should drive on the left in England would be much less if none of the 
other drivers knew it. This analogy also illustrates why multiple equilibria that must be 
selected by convention can arise under such circumstances. Pm 

16 Thus we may say that an equilibrium with respect to any (g, M, ) consists of a 
monitoring decision i € {1, 2} and a price function P;,(M, g, 5) such that (a) for anys the 
optimal relative price r;,(P, M, g, 5) equals unity when P = Pi(M, GS) and (b) the 
monitoring decision isi = 1 only if Vi(g) — VA(g) 2 2m, and i = 2 if and only if Vi(g) — 
VA eae 
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more accurate is the indicator, because an inaccurate indicator con- 
veys little useful information; (iii) the more variable is the level of 
aggregate demand, because there is little gain in monitoring a variable 
that is relatively constant; (iv) the larger is B, because there is little gain 
to monitoring a variable if its expected effect on demand is small; and 
(v) the further is the reaction coefficient from g, because when g = ¢ 
then, as can be inferred from comparing (22) with (23), the monetary 
authority is already accomplishing everything that private agents 
could accomplish by monitoring. 


IV. Optimal Monetary Policy and the Incentive to Monitor 


Three implications of this analysis are worth noting here. First, the 
SW proposition is valid for all values of g satisfying (34). For in this 
case monitoring will occur, and, as noted before, the behavior of 
output is unaffected by the value of g. According to (22) and (24) each 
agent’s expected utility will also be unaffected by the value of g. 

Second, whatever can be accomplished by monitoring can be ac- 
complished just as well by having the monetary authority set g = g. 
According to (34) no monitoring would occur, but according to (19), 
(20), and (24) the behavior of output and expected utility would be 
the same as if monitoring was occurring. 

Third, as was argued in the introduction above, private agents may 
not have much incentive to monitor s even when the social gains from 
everyone deciding to monitor are large. That is, the private value of 
monitoring, V3(¢) — V?(g), may be small when the social value, V1(g¢) — 
V?(g), is large. In particular, it follows from (22)—(24) and (26) that 


AE UE ASAE (35) 


where A, = 8?/(B? + a3) is a measure of the predictability of the effect 
upon effective demand of changes in x, being the square of the 
coefficient of correlation between the actual effect bx and the pre- 
dicted effect Bx. If the value of the coefficient 6 is known with cer- 
tainty, then the private and social values coincide. In the limiting case 
of pure uncertainty (when [6/c,]? — 0) the private value is infinitesi- 
mal in comparison with the social value. 

The reason for this divergence of private from social value is seen 
most easily in the case of a perfectly reliable indicator («2 = 0) and a 
neutral monetary policy (g = 0). If everyone monitors then the price 
level will adjust to fluctuations in x so that the sum m + x is constant 
and equal to m, with output equal to q! = bm. Thus everyone 
monitoring together will completely eliminate the effects of x upon 
output. An isolated agent monitoring when no one else is cannot stop 
m +x =m +x from fluctuating. He can, and will, through impercep- 
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tible changes in r alter the mean position of his demand curve so as to 
offset the predicted effect Bx, but there will still remain an unpre- 
dicted effect (6 — B)x. Specifically, by substituting (25) into (9) we see 
that his output will be g? = bm + (6 — B)x. Thus the social gain to 
monitoring comes from eliminating the total effect bx of changes in 
aggregate demand, whereas the private gain comes from eliminating 
only the predicted effect Bx. The ratio A, of private to social gain is 
just the ratio of variances of the predicted and total effects. 

Consider three different monetary policies. (i) Policy N is the neu- 
tral policy, g = 0, which is assumed to result in no administrative cost. 
(11) Policy D is the disseminating policy suggested by Barro of setting g 
= 0 but also publicizing the value of s, which is assumed to result in 
the administrative cost zq per person. (iii) Policy A is the activist policy 
of setting g = g, resulting in the administrative cost z,. Assume that 
the cost z, is incurred whenever any nonzero value of g is chosen. 
Then policy A will dominate any other policy with g # 0, and we 
may restrict our attention to policies A, D, and N. Which policy is 
optimal will depend upon the relationships between the costs and 
benefits of monitoring and the administrative costs of the different 
policies. Let 


Vs = V0) — V2(0) = (k/2)d, var (bx) (36) 


denote the social gain to monitoring and V? = V3(0) — V2(0) = A,V8 
denote the private gain to monitoring, when g = 0. Then according to 
(34) monitoring will never occur under policy A and will not occur 
under policy N if \,V* =z, + zp. Policy D is assumed to save the private 
agents any collecting costs, but if they are to use their knowledge of 
the value of s they must still incur the processing cost. Thus monitor- 
ing will not occur under policy D if \,V* Sz,. Table 1 depicts the cost 
per person of the different policies in the three situations indicated, 
where the cost is measured as V1(0) minus the expected utility result- 
ing from the policy plus the monitoring and administrative costs. In 
each situation the optimal policy is the one with the least cost. 


TABLE 1 


CosT PER PERSON OF MONETARY POLICY 














PoLicy 
SITUATION N D =.- A 
Leet pe Ze heey iter p an 
WZ iN Vas ie peta Vs alae (dG 


V2 Vie Za oe 


Note.—N = neutral policy, D = disseminating policy, A = activist policy. 
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Note that if the indicator is unreliable enough (i.e., A, small enough) 
or if effective demand is stable enough (var [bx] small enough), cetercs 
paribus, then a neutral policy will be optimal because both of these 
factors make V* small. Both of these factors also figure prominently in 
Milton Friedman’s (1953, 1968) case against activist policy. For the 
same reason, a small slope to marginal cost (small k) favors a neutral 
policy. Note also that in situation III the disseminating policy is 
dominated by the neutral policy. 

For our purposes the most important implication to note about 
table 1 is that an activist policy will be uniquely optimal whenever z, < 
Vs S z,/dy; that is, if (i) the indicator is reliable enough, effective 
demand is unstable enough, or marginal cost rises fast enough that V* 
exceeds the cost of administering an activist policy; and (ii) the cost of 
processing the information contained in the indicator is large enough 
or the predictability of the effect of changes in the level of aggregate 
demand is small enough that V* S z,/A,. These conditions do not 
require the monetary authority to possess any cost advantage. If A, is 
very small, they may hold even if the cost of administering an activist 
policy is much larger than the private cost of monitoring. If the 
conditions hold then an activist policy is optimal because despite the 
large social gain to monitoring no one will do it even if the cost of 
collecting information is saved. For even under a disseminating policy 
the user of the information contained in the indicator must incur a 
processing cost, and, as we have seen, he will be led to underestimate 
the gains from using the information. 


V. A Variant of the Basic Model: Monetary Policy and the 
Efficiency of Monitoring’’ 


One special feature of the basic model employed above is that the 
terms m and x enter into the aggregate demand function (5) with the 
same coefficient. This is why the optimal reaction coefficient ¢ always 
makes the expected level of output (conditional on s) equal to its 
optimal value, g (see [20]). When m and x have different random 
coefficients then Brainard’s (1967) analysis suggests that the optimal 
value of the reaction coefficient will generally be smaller than this. 
Such a case is easily constructed by replacing the utility function (1) 
with the variant 


U = K(q — x)%m'-4 — (k/2)q? + 2x, (37) 


in which case the indirect utility function is still given by (6), but the 


‘7 The rationalization for activist policy presented in this section is similar to the one 
presented by Fane (1977). 


ACTIVIST MONETARY POLICY 265 
effective demand function changes from (7) to 
q=bm+x. (38) 
As in (19) and (20), the value of output with or without monitoring 
can be expressed as 
Gn = b(m, — ys/B) + cs (39) 
dz = bm,(1 — gs) + cs. (40) 
Once again the expected value of output is 
E (qn) = E(gh|s) = E(qR) = Gn (41) 
and the variance of output is 
Var'q, — o,m,et o7l(o2 + y2o7)/B7] (42) 
var gn = opm; + osloz + (y — gm,B)? + (gm,)o3]. (43) 
Let us again restrict our attention to the limiting case of perfect 


competition. As before, the optimal value g’ of the reaction coefficient 
will be the one that minimizes var (q?). That is, 


B= Meyig: (44) 


If the activist policy g = g’ is pursued and no monitoring occurs, 
then 


E(q?|s) =4G + (1 — As)ys. (45) 


As in Brainard’s analysis, only the fraction \, of the predicted effect of 
a change in the indicator is offset by optimal monetary policy. In 
contrast to this, (41) implies that all of the predicted effect will be 
offset by the outcome of monitoring. Thus optimal monetary policy 
without monitoring can in this case accomplish more than monitor- 
ing. With private monitoring the real money supply will overreact to 
information about aggregate disturbances. Specifically, it follows that 
the social gain to monitoring when g = g’ is 


Vie ee V2(e = (hi2)\ 0.05111 (82 tos) = 1/871 <= 10, (46) 


Optimal monetary policy can be characterized in this variant model 
as we did in the basic model with results that are similar but with a 
complication, because even when g = g’ and the social gain to 
monitoring is negative, the private gain in this case is actually positive. 
Specifically, 


V8(g) — V?(g) = (h/2)A-o3(1 — Angie’), (47) 


18 See Appendix. 
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from which it follows that V3(¢’) — V2(g') > 0. Thus optimal monetary 
policy may require g to be somewhat larger than g’ so that people will 
not be motivated to monitor. 

For our purposes the important feature of optimal monetary policy 
is that the activist policy (g = g’) will be optimal whenever 


(RQ) Neo l) aNp ye zee ee 2A Oe (48) 
and 
Zq = (RI2)\eo2os1/ B21 (Bee as) Is (49) 


If (48) is satisfied then monitoring will occur under a neutral or 
disseminating policy but not under an activist policy. If (49) also holds 
then the activist policy is optimal because its cost (relative to V?[g’]) is 
Za, whereas, from (46) and the fact that V1(g) is independent of g, the 
cost of each of the other policies will exceed the RHS of (49) by the 
sum of monitoring and administrative costs. As in the previous sec- 
tion, the monetary authority needs no cost advantage for these condi- 
tions to be satisfied if \, < 1. For example, they may be satisfied even 
ifz, >z. + Zp», because the first term in (48) is just A, times the RHS of 
(49). 

Thus, an activist monetary policy can be justified because the mon- 
etary authority, by avoiding the overreaction to the indicator that 
would occur with monitoring, is able to accomplish more than could be 
accomplished by private monitoring. The overreaction of private 
agents monitoring s occurs because each individual sees himself as 
being able to offset the effect of s by changing his relative price, the 
coefficient of which, n, is known with certainty. Thus, he will try to 
offset completely the predicted effects of a change in s. As Brainard 
has shown, this is the optimal policy when the effect of the price 
setter’s action can indeed be predicted with certainty. But the effect of 
every agent trying to alter his relative price is a change in the 
absolute price level, the effect of which cannot be predicted with 
certainty. As Brainard has also shown, the optimal reaction when the 
effect of policy is uncertain is to attempt to offset less than 100 
percent of the predicted effect of the disturbance. The crucial as- 
sumption in this argument is that the degree of uncertainty attached 
to the effects of relative-price changes is different from that attached 
to absolute-price changes. Private agents take the former into account 
but not the latter. In the particular model used above the former 
uncertainty is less than the latter, but this is not essential to the 
argument. In the reverse case private agents would underreact to 
monitored aggregate disturbances, once again giving a rationale to an 
activist policy. 
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VI. Conclusion 


In summary, this paper has argued that an activist monetary policy 
may be justified even when people’s expectations are formed 
rationally if there are costs of gathering and processing information 
and if there is uncertainty concerning the exact values of parameters 
in the economic system. The justification that we have presented is 
twofold: first, that private agents may not be motivated to use infor- 
mation that can be used profitably by the monetary authority and, 
second, that even if they are motivated to use it they may not make as 
good use of it as could the monetary authority. 

The formal argument has been presented with a simple model 
employing specific functional forms chosen for their tractability. 
Therefore, much work remains before the degree of generality of the 
results can be ascertained. 

The first rationale that we provided for an activist policy should be 
distinguished from that of Fischer (1977) and Phelps-Taylor (1977). 
Their models are similar to ours in that prices are set in advance of 
trading rather than being determined by an independent auctioneer 
according to market-clearing conditions. However, their models allow 
activist monetary policies to improve welfare because they postulate a 
temporary rigidity in prices that prevents private agents from realiz- 
ing all mutually advantageous and perceived gains from trade. This 
raises the question of whether the contractual arrangements implicit 
in such models would be competitively sustainable (see, e.g., Barro 
1977). Our argument has gone one step further and shown conditions 
under which these contracts are in fact sustainable. We have not 
postulated any rigidity or imperfection that allows the monetary au- 
thority to react more quickly or with more flexibility to variation in the 
indicator than private agents can. Instead we have shown that private 
agents may choose not to act on as much information as the monetary 
authority possesses, or even as much as the monetary authority makes 
freely available to them, even when it would be Pareto improving for 
them to do so. In other words, this argument also provides an expla- 
nation for the fact that private contracts are typically written in nomi- 
nal terms without being indexed to the money supply or other aggre- 
gate variables. 

The paper has not addressed the question of whether or not the 
particular conditions derived above for the optimality of an activist 
policy actually exist in any real-world situation. The purpose has 
been mainly to argue that as a matter of principle the assumption of 
rational expectations does not imply the uselessness of an activist 
policy. It depends at least on the factors discussed above. Any bal- 
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anced evaluation of alternative monetary policies should also take into 
account at least two other important considerations. First, as Hayek 
(1945) has argued, the most important problem to be resolved is 
probably not what policy to pursue but whom to entrust with the task. 
An activist policy will work only if the authority can be entrusted or 
persuaded to pursue the public interest instead of his own and to 
pursue it competently. Second, as Hayek has also argued, the cen- 
tralization of decision making is most easily rationalized when the 
decisions require the use of general knowledge rather than local, 
specific knowledge. In the model of the present paper every indi- 
vidual agent is essentially identical, so there is no clear distinction 
between these two kinds of knowledge. But more generally one would 
expect the aggregate disturbances to which monetary policy reacts to 
come under the heading of general knowledge, about which most 
individual agents can be expected to have little expertise. Thus a 
further argument in favor of an activist policy is that it allows society 
to centralize, and thus to bring to bear an efficient degree of expertise 
on, decisions concerning such general knowledge. 


Appendix 


This Appendix sketches a formal derivation of the private value of monitor- 
ing. In general terms, suppose that a decision maker has an information set [ 
and his problem is to choose the value of decision variable x so as to 


max E (a + a,x + a,x? |Z) (Al) 
{xr} 
where the a,’s are random parameters. Then the expected value of receiving 
the information contained in the new information set I’ is the expected 
difference in the optimized value of (A1) resulting from replacing / by I’; that 
is, 


E {(E (a, |I)P/4E (a2 |) — LE (a, |Z')P/AE (a | IY}. (A2) 


1. To derive (26) from (A2) note that in this case a, = (q3 + n)(1 + nk), a, = 
=(ni/2)(2 -# nk), 1 = Oy and J’ = f5\. a 

2. To derive (28), let x be the average relative price, PE[P;(M, g,s)~"], and 
note that in this case a, = en(q) + n)(1 + nk), as = —e2(n/2)(2 + nk), I = O, and 
I’ = {s\. 

3. To derive (48) proceed exactly as in deriving (26) and take the limit as n 
> 0, 
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Equalizing Discrimination and Cartel Pricing 
in Transport Rate Regulation 
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There are two possible outcomes of transport regulation: (1) main- 
taining a carrier cartel and (2) imposing equalizing discrimination 
against advantaged and in favor of disadvantaged shippers. Both 
functions have required a complex rate structure to enforce the 
respective forms of price discrimination. Using a sample of freight 
bills from motor carriers and railroads, this paper demonstrates that 
the principal result of motor carrier regulation has been to maintain 
a cartel of truckers, while railroad regulation has thwarted the wishes 
of the railroad cartel by imposing equalizing discrimination on weak 
and strong shippers. Motor carrier rates respond in an economically 
rational manner to costs and shipper bargaining power; rail rates are 
either unresponsive or perversely responsive to the same factors. 
Deregulation should have divergent effects in the two industries. * 


Deregulation is a policy that has been recommended in recent years 
for some but not all regulated industries (see, e.g., Council of Eco- 
nomic Advisers 1979). Candidates for deregulation are most often in 
the transport sector and never in the traditional public utilities such as 
electricity and natural gas distribution. Why is transport regulation con- 
sistently singled out for attention? Also curious are the extraordinary 
attempts of motor carriers to retain the regulatory fetters, while the 
railroads make somewhat half-hearted attempts to remove theirs (see, 
e.g., Williams 1978). What is it about the type of regulation that makes 
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H. Trebing, G. Wilson, P. Schmidt, R. Rasche, D. Suits, S. Tainter, and an anonymous 
referee. 
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one transport industry wish to retain—at all costs—what another 
transport industry is willing to shed? 

As this paper explains, the answer to both questions is found in the 
institution of transport rate making. There are two key differences 
between traditional forms of price regulation and transport rate con- 
trols. First, in most instances, regulation involves category pricing, 
while transport rates are tailored to individual buyers. Second, trans- 
port rates are typically the result of collective rate making, while most 
other regulated prices are not. Together, these aspects allow the 
regulatory authority to bestow advantages and disadvantages on indi- 
vidual communities and industries; they also allow the industry to 
stabilize a sellers’ cartel. 

This paper demonstrates that there would be two distinct results 
from deregulating the motor carrier and railroad industries since 
there are divergent outcomes of regulation in the two modes. Current 
motor carrier rates show classic signs of cartel rate making; railroad 
rates reveal Interstate Commerce Commission (ICC) attempts to 
equalize advantages and disadvantages of shippers and localities. De- 
regulating the trucking industry would inhibit cartel rate making; 
deregulating the railroad industry would lead to a more economically 
rational pattern of prices, with some rates rising and some falling. 

To emphasize the unusual nature of transport pricing, Section I 
shows what category pricing would be like in transport. Section II 
shows how individualized rates and collective rate making lead to 
unpredictable outcomes in the regulatory process. Unlike category 
pricing, transport rate making is ideally suited for either cartel pricing 
or enforcing equalizing discrimination. Section III investigates the 
current motor carrier and railroad rate structures and finds that 
ditferent principles are apparent in the pricing of the two modes. The 
paper concludes with implications of the findings for deregulation 
arguments. 


I. Transport Prices Are Individual Rather than Category Rates 


Compared with motor carriers and railroads, the rate structure for 
most regulated industries is the model of simplicity: The customer’s 
class of service is determined, and the block pricing for the class is 
applied. Such category pricing has two important implications. First, a 
customer’s charges can be raised or lowered only by altering the prices 
assessed everyone in that category. Customers cannot demand special 
price concessions and are protected from unusually high rates by the 
presence of other members in the category. Category pricing prevents 
bargaining over individual charges between suppliers and customers. 

The second implication is for regulators. Because the pricing is 
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relatively simple, the regulatory authorities can make informed deci- 
sions about each price. The costs of providing service to a category of 
customers can, at least in principle, be calculated with some accuracy; 
again in principle, the regulators can check the supplier’s cost calcula- 
tions with a limited staff. 

The costs of providing transport service are quite variable (Harris 
1977; Spady 1979). Thus category prices for railroads and motor 
carriers would be more complicated than the price structures of most 
regulated industries. Such a scheme would, nonetheless, be feasible. 
For example, railroad car rentals (which could change daily according 
to availability) could be separated from haulage charges. Haulage 
charges could be divided into categories depending on the number of 
cars in the train, the regularity of movement, and so forth. Track 
rentals could also be charged and graduated according to expected 
maintenance charges on a link traversed. Congestion tolls could be 
assessed if a shipment used a link with excessive traffic. Separate 
charges could be assessed for using switching locomotives and renting 
yard space. 

Similarly, motor carriers could be compelled to charge separate 
prices for separate services: one price for pickup, another for termi- 
nal sorting, another for use of line-haul service on a certain link, and 
so on. Total transport charges would then be determined by summing 
the costs of the service categories used. 

As long as a transporter could charge for only a limited number of 
service categories, this scheme would mean that a customer’s charges 
could be altered only by changing the rates of all those in a category, 
and that, by limiting the number of prices reviewed by the ICC, a 
competent staff could determine the relation between the price and 
the cost of a service. As demonstrated below, if transport regulation 
had the nature of category pricing, there would be scant reason to 
expect intense political pressure either to preserve or to remove the 
regulatory apparatus. 


II. The Implications of Pricing Services Individually 


Most transport sales are not based on category prices but use a scheme 
of exception or commodity rates. Each price is tailored to the indi- 
vidual buyer (Locklin 1972). As demonstrated below, this has inevita- 
ble consequences for the relations among carrier, customer, and reg- 
ulator. 

A system of class rates for motor carriers and for railroads does 
exist. Under this scheme, each commodity is given a class rating and 
each origin-destination pair is given a rate base number. The trans- 
port price is the product of these two numbers. The class-rate tariff is 


TRANSPORT RATE REGULATION 273 


thus a two-category scheme. It makes pricing simple and thus capable 
of intelligent oversight by the ICC; it also means rates cannot be 
changed for one shipper without simultaneously changing them for 
all others in the same class. 

Because the class-rate scheme is only a two-category system, it can- 
not adequately reflect carrier costs which vary in more than two 
dimensions. In particular, neither economies of traffic density nor the 
likelihood of receiving a backhaul is accounted for in the class-rate 
system. 

These unusual cost circumstances can be accommodated through 
the commodity-rate and exception-rate schemes which allow the 
quotation of a special price for shipping a good defined at greater 
than seven-digit precision between any two stations. That rates can be 
quoted so precisely reveals that they apply in practice to individual 
shippers. Commodity and exception rates are the opposite of category 
rates—they are individual rates. 

Commodity and exception schemes are now almost the universal 
pricing method for rail shipments, and they are the predominant 
form of truckload rates. The class-rate system is reserved for occa- 
sional shippers and for much less-than-truckload traffic. 

To understand the nature of transport rate regulation is to under- 
stand the interaction between the ICC and those parties that propose 
a change in a commodity or exception rate. More than 85 percent of 
motor carrier rate changes are reductions proposed by carriers or, 
more frequently, the rate bureaus to which they belong.! The univer- 
sal rationale is to retain or gain traffic that would otherwise be moved 
by another mcde. 

Under individual pricing, shippers may gain advantages over com- 
petitors if their own rates are lowered but other shippers’ are not. 
To bring this about, a shipper must convince the carrier that either 
(1) existing traffic will be diverted without the lower rate or (2) new, 
formerly uneconomical markets will be opened up if the rate is low- 
ered. A shipper has two hurdles: First, the shippers must convince the 
carriers who would offer the rate that the threat of traffic gain or loss 
is genuine;” second, should the rate be protested, the ICC must rule 


‘This figure was quoted by M. A. Godecker of the Central States Motor Freight 
Bureau. My understanding of regulatory procedure in this paper benefited by discus- 
sions with Godecker and J. R. Ahlstrom of the Western Truck Line Committee. 
Substantial alterations in the procedures for regulating railroads and motor carriers 
were passed by Congress in the time subsequent to the writing of the paper. The forms 
of regulation discussed in this paper were those practiced in through the spring of 
1980. 

* Motor carriers claim that collective rate making is necessary to protect them from 
shippers’ opportunistic misrepresentation of the likelihood of traffic diversion. 
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on it. To go into effect the rate must be declared just, reasonable, and 
nondiscriminatory. 

Precisely what criteria the ICC uses to judge rates have never been 
defined. The commission does demand cost evidence in the form of 
standard cost statements. It recognizes, however, that the costs sum- 
marized in the statements are average costs, and thus additional 
evidence of haulage costs is used; the standard piece of evidence is the 
rate of analogous hauls. 

The basic ICC regulatory procedure assures a balance of rates on 
analogous shipments. A just, reasonable, and nondiscriminatory rate 
is one which is not out of line with rates on analogous hauls. What is in 
practice regulated, then, is the rate relationship among analogous or 
competing hauls. 

The result of this regulatory procedure has been a transport rate 
structure of fantastic complexity: Hundreds of billions of individual 
prices have been established and approved. In contrast, under the 
typical category scheme used in most regulated industries, there are 
rarely more than 100 separate rate categories. The complexity of the 
transport rate structure precludes staff investigations of price-cost 
relationships for individual rates. Indeed, the very complexity of the 
rate structure discourages cost finding. With traffic equal to the sum 
of large numbers of unique movements, virtually all costs will be 
found to be joint or common and thus logically or practically untrace- 
able. 

This inability to cost individual movements has distinct advantages 
both to carriers and to the ICC since it increases the discretion of each 
party to pursue separate goals in the regulatory process. The lack of 
reliable cost figures frees the rate bureaus to engage in demand 
elasticity—based pricing which is implicit in the policy of granting rate 
reductions only to those shippers who can credibly threaten traffic 
diversion. Without cost data, discrimination is extraordinarily difficult 
to prove. The complaint of a firm that lower rates are charged to a 
rival can be answered by the apology that the rate concession was 
necessary to retain traffic. It is, of course, crucial to the proper 
working of the scheme that price-induced traffic diversions to firms 
with identical cost structures be inhibited, otherwise the process of 
collective rate making will evolve toward cost-based prices. 

The difficulty of relating prices to costs gives wider latitude to the 
regulatory authorities to pursue what Hilton (1972) has called “the 
basic behavior of regulatory commissions.” Hilton identified the goal 
of regulation as dissipation of monopoly profits through cross- 
subsidization; a preferable description in the case of transport would 
be requiring discrimination which equalizes the advantages and dis- 
advantages of different purchasers of transport service. The phrase 
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“equalizing discrimination” is preferred to the more familiar title of 
“cross-subsidization” since the latter implies that the carrier uses 
profits from one service to finance another. But if carrier operations 
generally run a deficit, no service is paying for any other. The term 
“equalizing discrimination” emphasizes that the ICC is interested in 
equalizing the conditions of advantaged and disadvantaged shippers 
rather than in having one party pay the bills of another. 

Equalizing discrimination is the natural result of attempting to 
prevent “unjust” or “extractive” discrimination (i.e., pricing to extract 
rents from shippers and localities) without using a cost comparison. 
For example, the volumes generated by a large shipper typically will 
produce both lower carriage costs and many eager carriers, while a 
small shipper will have fewer alternatives and higher carriage costs. 
To prevent exploitation of the small shipper’s lower demand elastic- 
ity, in the absence of a reliable cost comparison, the regulatory au- 
thorities have no option other than to require equal prices for both 
shippers. The burden of this equalizing discrimination naturally falls 
on the advantaged shipper. 

Explicit examples of equalizing discrimination can be found in 
policies that require identical prices for shipment of goods from the 
Midwest to any East Coast port regardless of distance, require low 
rates on California citrus fruit and high rates on Florida produce so 
that both can compete in Northeast markets, and discourage the 
quotation of unit train rates even when there are obvious cost advan- 
tages (MacAvoy and Sloss 1967). 

The pressures on the ICC to practice such equalizing discrimina- 
tion are most intense in the case of relatively low value, homogeneous 
commodities such as agricultural products, coal, steel, and paper. 
Demand for these goods is most sensitive to delivered prices, and 
their profitability is most strongly affected by transport prices. These 
are also the goods carried predominantly by rail. In contrast, the ICC 
gets less intense pressure to use transport prices to equalize the 
conditions of manufacturers of finished goods—the most important 
customers of motor carriers. This is because demand is less sensitive 
to transport prices and because transport rates are a smaller part of 
delivered cost. 

Not only is there less pressure on the ICC to produce equalizing 
discrimination for those commodities hauled primarily by motor car- 
riers, but the commission has less leeway, since carriers can use their 
own trucks if common-carrier rates get too high. One would expect, 
therefore, that the ICC would give more care to regulating rail than 
motor carrier rates: Their range of actions and the participants’ 
interests are greater than in truck rates. 

In the next section the rate structures of the two modes are investi- 
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gated to test the hypothesis that the regulatory influence of the ICC is 
more evident in railroad than in motor carrier pricing. 


III. Empirical Evidence of Different Regulatory Regimes for 
Railroads and Motor Carriers 


To demonstrate the dual outcome of ICC regulation, individual rates 
were regressed on factors which proxy demand elasticities and costs 
of providing service. A cartel unthwarted by the ICC will have rates 
that vary directly with the costs of service and inversely with demand 
elasticities. 

It is a good deal more difficult to specify what would determine 
rates under equalizing discrimination. However, two aspects are likely: 
First, cost savings of a large shipping volume will not be passed along 
to customers in the form of lower charges; and second, having fewer 
shipping options is likely to lower rather than to raise rates since a low 
demand elasticity indicates a vulnerable shipper. 

The complexity of rail and motor carrier rate structures derives 
from the discriminatory aims of both the rate bureaus and the reg- 
ulators. But this same complexity precludes investigating rate struc- 
tures in any manner other than explaining the patterns of actual 
charges for purchased services.’ A sample of rail freight bills has long 
been available from the ICC, but the similar data file for motor 
carriers is the proprietary information of motor carrier rate bureaus.4 
This study was made possible by obtaining the latter file for the year 
1972 and comparing it with the similar rail file. 

The six analyzed commodities (the same for both rail and motor 
carriers) are perhaps not typical of either motor carrier or rail traffic 
as a whole; the two modes do not have identical traffic mixes.> Rather, 
the commodities were chosen to be representative of the goods con- 
tested by both railroads and motor carriers. 


* The use of a sample of actual freight bills to model the rate structure rather than the 
alternative procedure of using rates printed in formal tariff statements is preferred for 
several reasons: (1) There are always numbers of rates available to any shipper, and it is 
not clear which would be the governing rate; (2) charges are sometimes different from 
those printed in tariffs; and (3) a large part of the rate structure is nominal in the sense 
that no freight is ever moved under these rates. 

* The rail data are the ICC “One Percent Waybill Sample of Carload Terminations” 
(U.S. Interstate Commerce Commission 1972). The motor carrier data are from the 
“All-Bureau Continuing Traffic Study for 1972” (Rocky Mountain Motor Freight 
Bureau 1972). 

* The six commodities are: prepared feed NEC, assigned STCC numbers 2042129 
and 2042190 and NMFC number 72910; wrapping paper, STCC 2084 120/21, NMFC 
111510, 2621490, 151800; pulpboard or fiberboard, STCC 2631115/16120, NMFC 
151310; corrugated fiberboard or pulpboard boxes, STCC 2651141, NMFC 29275; 
tires, STCC 3011110, NMFC 157230. Fora complete description of commodity selec- 
tion procedures and data filters, see Boyer (1978). 
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The charges for shipping a hundredweight of each of the six 
commodities between various end points were studied. One of the 
strongest influences was found to be the size of the shipment. In the 
Appendix, I describe the procedure used to adjust the reported 
charge per hundredweight in each freight bill for variations caused by 
differences in loading weight. The dependent variable whose varia- 
tion is explained by the regressions in this section is a least-squares 
forecast of what the charge per hundredweight on a freight bill would 
have been had the shipment been made at a standard size of 340 cwt 
for trucks and 680 cwt for trains. Only boxcar movements of rail 
commodities were considered. 

The independent variables which could be generated from the 
freight bill samples were the length of the haul (M/); the number of 
shipments of this commodity, on this haul, in the sample (SHP); the 
number of alternative destinations open to a shipper of this commod- 
ity from this origin (DST); the number of origins for this commodity 
from which a receiver at this destination can choose (ORN); and 
existence of active competition from the other regulated mode of 
transport (NT). It can be assumed that costs per hundredweight 
increase with distance and decrease with shipping volume. Similarly, 
large values for DST and ORN as well as a positive value for INT 
should indicate a shipper with a higher demand elasticity. In addition, 
the existence of private trucking suggests that shipping volume will be 
positively related to demand elasticities. 

The functional form for these regressions was: 


In (RATE,) = 6,(In MIJ,) + 6,(In MI,)? + 63(In MI,)3 
+ 64M (In MI, — In 100)? + 6;N(In MI, — In 500) 
+ C, In (SHP;) + C, In (DST;,) + Cy In (ORN;) + C,(INT) 


n 
a SdDut Ci, 
j=1 


where 


RATE = cents per hundredweight shipping charges (see 


Appendix); 
MI = length of haul; 
M = 1 if MI, > 100, 0 otherwise; 
N = 1 if M; > 1,000, 0 otherwise; 
SHP = number of freight bills in sample corresponding to 


this haul and commodity; 


® These correspond to truckloads of 17 tons and carloads of 34 tons which were taken 
to be representative in Meyer et al. (1959). 
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DST = number of different destinations for freight bills in 
sample with this origin and commodity; 
ORN =number of different origins for freight bills in 
sample having this destination and commodity; 
INT = 1 if there is a freight bill in both samples with this 
commodity, destination, and origin and 0 if there is 
no intermodal competition; and 
= | if observation is for commodity j and 0 if obser- 
vation is not for commodity 7. 


De 


nN 


The functional form contains a cubic spline of distances with knots 
placed at 200 and 1,000 miles.’ I used this piecewise cubic approxi- 
mation of the distance taper to avoid prior constraints on the shape of 
the distance relationship. 

The regressions in table 1 show the results for the motor carrier 
rate structure. Table 1 also contains two rail regressions, one for the 
complete set of six commodities and one for a restricted set of the 
four goods for which haulage weight/rate relations could be estimated 
(see Appendix). ! 

As reported in table 1, the results strongly support the conjecture 
that motor carriers use a cartel rate structure unhampered by goy- 
ernment interference. Large shippers do get lower rates (the elasticity 
of rates with respect to the number of shipments is —.039) as pre- 
dicted by the cartel model; the cost of serving large shippers is lower 
and the likelihood of switching to private trucking is higher. The 
coefficient on the number of destinations used by the seller (DST) and 
the number of origins used by the receiver of a shipment (ORN) 
similarly confirm that the greater the number of competing hauls, the 
lower the rates. Finally, the coefficient on INT suggests that motor 
carrier rates are approximately 6 percent lower where there is actual 
competition from railroads than where there is not. All of these 
findings are consistent with cartel rate making in the trucking indus- 
try. 

The contrast to the rail results is striking. The response of rail rates 
to larger shipping volume (SHP) is insignificant. A large number of 
destinations (DST) lowers a shipper’s rates, but a larger number of 
origins (ORN) does not similarly lower rates. And, finally, the: pres- 
ence of motor carrier competition (INT) is associated with higher, 
not lower, rail rates. With the exception of the negative sign on DST, 
the remaining results are those predicted by a policy of equalizing 
discrimination where rates are either unresponsive or perversely re- 
sponsive to cost conditions and demand elasticities. The negative 


On estimating spline functions with ordinary least squares, see Suits, Mason, and 
Chan (1978). 


TRANSPORT RATE REGULATION 279 
ABER 


DETERMINANTS OF REGULATED FREIGHT RATES, 
Cusic SPLINE REGRESSIONS 














MorTor RAIL 
CARRIER, 
EQ. (1) Eq. (2) Eq. (3) 

In MI 3.85 (3.71) —17.35 (4.53) —20.64 (4.87) 
(In MI)? amen (ali) 3.16 (.89) 4.11 (.96) 
(In MI)3 .05 = (.04) =) (({D5rsh) aoe (.06) 
(In MI — In 100) —.04 (.06) .22 (.074) Dik (.079) 
(In MI — In 500) 192 (GLO) ge (GI) ai] Eu) 
SHP —.037 (.014) SUI (OI) =F OO SummeOlt2)) 
DST — .0062 (.0042) —.014 (.006) —.014 (.006) 
ORN — OU (11538) .0018 (.0070) —.00031 (.0071) 
INT —.062 (.021) .060 (.020) .064  =(.021) 
Dummy variable: 

3011110 .12  (.0079) 46 (.015) 48 (.015) 

2651141 5200 (06) 51  (.028) Sigal Rs 

2631115 2 ee(-Oii3) .067 (.0090) .09 (.008) 

2621490 Sl (OU Pili ami 70) 15 (.017) 

2084120 el eeOSs) —.11  (.066) Aes ae 

2042190 —.24 (.016) —.61 (.014) — 35 (.014) 
R? 54 82 .83 
N 6,472 4,062 3,836 
F 658.78 904.06 855.56 








Sources.—U.S. Interstate Commerce Commission 1972; Rocky Mountain Motor Freight Bureau 1972. 
Note.—Dependent variable = fitted cents/hundredweight for end-point pairs. Numbers in parentheses are SEs. 


coefficient on DST indicates that there is some rational response to 
demand elasticities as well—an indication of some cartel rate making 
within a structure whose primary characterization is enforcing 
equalizing discrimination. 

The regressions in table 1 give other evidence consistent with the 
hypothesis that railroads are thwarted by regulatory authorities while 
motor Carriers set rates just as a successful trucker’s cartel would wish. 
The first evidence comes from the spline variables on mileage. The 
mileage tapers, expressed on the vertical axis as a percentage of 
charges for a 500-mile haul, are graphed in figure 1.8 The two modes 
appear to be very similar; the rail taper has unusually low short-haul 
rates and relatively high charges for the longest hauls. This is pre- 
cisely the opposite of what would be generated by cost-based rates and 
contradicts the basis for many studies of optimal intermodal traffic 
allocation (Friedlaender 1969; Levin 1978). The slopes and positions 
of the two curves indicate that the observed propensity for long hauls 


® On the safe assumption that rail rates will be lower than road rates at the standard 
500-mile haul, the results suggest that rail and truck rates either diverge as mileage 
increases or that the rail curve cuts the motor carrier rate curve from below. 
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PERCENT OF CHARGES FOR A 500 MILE HAUL 





oS ° So So o So oo 
sesss3s33s3 33:83 83 8 8 88 
SNe eee ee ae eee eS ee eS 
MILEAGE 


Fic. 1.—Mileage tapers for rail and motor carriers. Source: spline coefficients esti- 
mated in table 1. 


to be shipped by rail and short hauls by truck is not the result of 
shippers responding to cost-based rates but, rather, must be due to a 
convergence of service qualities. 

The results are consistent with an ICC policy of “fair sharing 
traffic” among modes. Rail rates are held high for long hauls to allow 
motor carrier participation and are allowed to fall for short hauls 
where motor carriers do not have to fear serious traffic diversions.® 

A final piece of evidence for the differing regulation of motor 
carrier and railroad rates comes from the commodity-specific dummy 
variables in table 1. Converting logarithmic coefficients to percentages 
yields the data in table 2. The numbers are average rates for each of 
the six commodities. The variation in rail rates explainable by com- 


® On fair sharing of traffic, see the “Ingot Molds Case” (American Commercial Lines, 
Inc., et al. v. Louisville and Nashville Railroad Co. et al. 1968). Note that the evidence is 


also consistent with one or the other, but not both, of the modes of pricing under cartel 
rate making. 
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TABLE 2 


VALUE OF SERVICE PRICING 








PERCENTAGE OF CHARGES ON AVERAGE COMMODITY 











Motor Rail 

Carrier 
COMMODITY from Eq. (1) From Eq. (2) From Eq. (3) 
3011110 13.22 159.44 163.02 
2651141 130.31 167.35 fogs 
2631115 74.41 106.97 109.46 
2621490 87.96 114.37 116.72 
2084120 88.58 88.95 ae 
2042190 78.01 53.80 55.49 
SD 21.82 42.90 44.05 





Source.—Table 1, antilog of commodity dummy variables. 


modity value is more than twice as high as that for motor carrier rates: 
The column standard deviations are 42.9 percent for rail as opposed 
to 21.8 percent for truck. This suggests that rail rates are more closely 
related to commodity type than are truck charges.!° This finding is 
consistent with a railroad industry which maintains an obsolete system 
of value-of-service pricing and a trucking industry which has adapted 
to an environment of intermodal competition. 


IV. Conclusion 


The evidence in this paper shows that both motor carrier and railroad 
rates are different from those that would be observed without regula- 
tion. The pattern of rates suggests, however, that regulation of the 
two modes has had different results. Perhaps the strongest support 
for the hypothesis that railroads and motor carriers are subject to 
different types of regulation comes from the public posture of the two 
modes toward deregulation. Within the last years, the Association of 
American Railroads has endorsed a “deregulation” proposal. How- 
ever, the proposal does not suggest that collective rate making be 
abandoned. In contrast, the American Trucking Associations is 
vehement in its opposition to any changes in transport regulations. 
This is consistent with the findings of this paper: (1) Motor carrier 
regulation ratifies a carrier cartel, while (2) rail regulation has 
classic equalizing discrimination as its primary characteristic. 

The removal of government oversight over rates and entry and the 
simultaneous outlawing of collective rate making should have diver- 


10 The simple correlation between cols. a and b is .71. 


282 JOURNAL OF POLITICAL ECONOMY 


gent outcomes in the railroad and motor carrier industries. A de- 
regulated trucking industry would—at least in truckload service— 
develop a rate structure more closely related to costs of service. A 
general reduction in trucking prices would be expected. 

Deregulating rail rates would have far more complex results. Given 
the small number of rail carriers in any market, a strictly cost-based 
rate structure could not be predicted; nor, if rail marginal costs are 
below average costs, would this result be desirable. Instead, my 
analysis suggests that a more rational rail rate structure would result, 
one based both on costs and on the ability to divert traffic (the latter 
related, presumably, to trucking costs). Deregulating rail rates should 
not lead to a general increase or decrease in rail rates but to a 
reshuffling of prices. 

This expectation is confirmed by the recent removal of ICC over- 
sight over fruit and vegetable tariffs. The diversion of traffic to 
unregulated truckers over the last 2 decades effectively defeated 
the attempt at equalizing discrimination, and in 1979 rail rates on this 
traffic were deregulated. The result was a simultaneous increase of 
some rates and a sizable diversion of traffic to railroads in response to 
lower rates for other hauls (Association of American Railroads 1979; 
American Trucking Associations 1979). There is every reason to 
expect that deregulation of rail rates for other commodities would 
have a similar effect. 

The disparate effects of regulation in the two modes are the direct 
result of the institution of pricing services individually rather than by 
categories. Individual pricing, which has allowed the ICC to pursue a 
policy of equalizing discrimination, has also required the use of rate 
bureaus and collective rate making. Had the transport industries used 
category rate making, regulation to enforce equalizing discrimination 
would have been hampered and collective rate making would have 
been unnecessary, and, therefore, it is doubtful that the fervent calls 
for and against deregulation would be heard today. 


Appendix 


The regressions in the text explain variations in fitted or standard rates for 
each commodity-origin-destination combination. These rates are adjusted for 
differences caused by variable loading weight among different commodities. 

While most discussions of transport pricing proceed on the assumption that 
there is a single charge per unit of service, in fact this is not the case. There 
are separate charges per hundredweight for shipments of a commodity be- 
tween two end points for (a) minimum charge, (5) less-than-truckload (LTL), 
(c) truckload (TL) or carload, and (d) very heavy shipments. Due to the 
different charging regimes there is an inverse relationship, “a weight taper,” 
between charges per hundredweight and loading weight. To develop a 
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structure of single rates for commodities between end points, the first step 
was to estimate a weight taper for each commodity. 

The weight tapers were estimated on the basic assumption that for every 
origin-destination pair, the shipping charges per hundredweight for a ship- 
ment of one size would be proportionate across different levels to the charges 
for a shipment of a standard vehicle load of the commodity. The standard 
procedure of spline estimation was used to estimate the shape of the taper in 
the absence of prior knowledge of its functional form. Knots were arbitrarily 
placed at 100 and 300 cwt for motor carrier estimation and at 500 and 1,000 
cwt for railroads. 

The functional form used to measure the rate taper with respect to loading 
weight for each commodity was: 


In cents; = b, (In wt;) + 6, (In wt;)? + 63 (In wt)? + b,F (In wt; — Inf 


+ b;G (In wt; — Ing)? + >» c;Dy + e:, 
j=l 
where 


cents; = cents per hundredweight for shipment; 

wt; weight of shipment in hundredweight; 

Dj = haul dummies: D;; = 1 if the shipment is on end-point pair, 
Dx = 0 if the shipment is not on end-point pair j; 

F 1 if wt; > 100 for motor carrier or wt; > 500 for railroads, 0 
otherwise; 

7 = 100 for motor carriers, 500 for rail; 

G 1 if wt; > 300 for motor carrier or wt; > 1,000 for rail- 
roads, 0 otherwise; 

g = 300 for motor carriers, 1,000 for rail; and 

ei = error term. 


The values of the spline variables are listed in tables Al and A2 for rail and 
motor Carriers, respectively. The number of dummy variables is listed in each 
table as the number of hauls. For two commodities, there was insufficient 
variation in the independent variable to estimate the relationships for rail- 
roads. 

In general, the tapers have at least one steeply downward-sloping seg- 
ment which for motor carriers can be interpreted as the transition from LTL 
rates to TL rates. As should be expected from commodities that differ in load- 
ing characteristics, this steep downward-sloping section occurs at different 
weights for different commodities. 

Some commodities have flat motor carrier tapers beyond the first down- 
ward-sloping segment, while others continue to fall; still others havea slightly 
upward-sloping segment at the very end. Since spline functions are piece- 
wise cubic, it is not clear how much faith should be placed in the aberrant 
behavior of the tapers of some commodities for very large and very small 
shipment sizes. ee 

The downward-sloping segment of the weight taper for rail-shipped com- 
modities is found in most cases at much larger shipment sizes than for motor 
carriers. For most rail commodities, the downward slope is not as steep as for 
trucks. As must be expected for commodities with different loading charac- 
teristics, there are exceptions to this rule. Without exception, however, the 
downward slopes for rail commodities are at a much higher range than for 


truck commodities. 
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TABLE A2 


RAIL WEIGHT TAPER REGRESSIONS 








COMMODITY 








2621490 2631115 
INDEPENDENT 2042190 Wrapping Pulpboard or 3011110 
VARIABLE Feed Paper Fiberboard Tires 
b, 172.48 SS LOSEAS — 128.61 -7,.47 
by —30.02 17.61 lel 1.10 
bs 2 oo leo =.09 
bs Sol 1.85 1.62 Owl 
bs Solel 1.69 Oe 250.37 
R? .963 -998 .980 .979 
ie .08 .90 378 296 
Observations 1,187 562 3,159 999 
Hauls 890 432 1,941 577 
F 5.38 231 147.63 35.12 





Source.— Interstate Commerce Commission 1972. 
Nore.—Dependent variable = In (cents per hundredweight). Equations for wine and boxes could not be estimated 
due to insufficient variation in the independent variables. 


Two coefficients of multiple determination are reported for each equation. 
As usual, R* is interpreted as the percentage of the total variation in the 
dependent variable which can be explained by the right-hand-side variable. 
This is misleading, however, since the extensive use of dummy variables 
accounts for much of the apparent explanatory power of the regressions. The 
second coefficient of multiple determination, 72, is defined as: 


SS oy - Fy 


i=1 j=1 
= 





nj ni om 


OM (Yan — Y,) = 2) (a Wey 


i=1 j=1 i=1 j=1 


> 


where Y;; is the 7th observation on the dependent variable in group j, ¥; is the 
mean of the dependent variable for observations in group j, and YF, is the 
fitted value of the ith observation on the dependent variable in group J. 
Numbers in the row marked 7? are thus a more accurate indication of the 
explanatory power of the five spline variables. The reported F-statistic is 
computed using 7”. 

The dummy variables in each equation were computed using the analysis of 
covariance. They were evaluated, however, not at 0 cwt but at 340 cwt for 
trucks and 680 cwt for trains. The dummy variables thus are interpreted as a 
standard rate for each haul and commodity: a least-squares forecast of the 
rate had all shipments been made at the standard size. The standard rate thus 
purges all influence of variable loading weights within a commodity. For the 
two rail commodities whose weight tapers could not be estimated, the stan- 
dard rate was computed as the average charge for the haul, regardless of 
loading weight. 
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By considering investor order placement strategy, this paper dem- 
onstrates that transaction costs cause bid-ask spreads to be an equi- 
librium property of asset markets. With transaction costs, the prob- 
ability of a limit order executing does not go to unity as the order is 
placed infinitesimally close to a counterpart market quote; thus, with 
certainty of execution at the counterpart market quote, a “gravita- 
tional pull” is generated that keeps counterpart quotes from being 
placed infinitesimally close to each other. An equilibrium spread is 
defined and its size linked to market thinness; implications are noted 
for the design of a trading system. 


I. Introduction 


This paper establishes that transaction costs in secondary asset mar- 
kets cause individual investors to use order placement strategies that 
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result in a nontrivial market bid-ask spread.’ We define an equilibrium 
market spread and demonstrate that it will be greater for thinner 
securities. 

The analysis fits into a growing body of literature which increas- 
ingly is being referred to as the microstructure of security markets. 
Stigler (1964), Demsetz (1968), West and Tinic (1971), Tinic (1972), 
and Tinic and West (1972, 1974), along with Farrar, Smidt, Stoll, and 
others involved in the institutional investor study (see U.S. Securi- 
ties and Exchange Commission 1971), were among the first to focus 
rigorous analytical attention on the operations of security markets. 
More recently, microstructure theory has been explicitly considered 
by, among others, Garman, who coined the term (1976), Beja and 
Hakansson (1977), and Cohen, Maier, Schwartz, and Whitcomb 
(hereinafter CMSW) (1978). The analytical issues addressed involve 
the interplay among market participants, trading mechanisms, and 
the dynamic behavior of security prices. In the present paper, we 
study the formulation of optimal investor trading strategies and how 
these interact with one aspect of the dynamic behavior of security 
prices, the bid-ask spread. Spreads are of concern to investors because 
they are a variable cost of trading by market order and because they 
cause an inflation of transaction-to-transaction returns variance. 

The pioneering analysis of bid-ask spreads was provided by Dem- 
setz (1968). Further studies include: Tinic (1972), Tinic and West 
(1972, 1974), Benston and Hagerman (1974), Garman (1976), Ham- 
ilton (1976, 1978), Branch and Freed (1977), Stoll (1978a, 19786), 
Ho and Stoll (1979, 1980), Newton and Quandt (1979), Schleef 
and Mildenstein (1979), Smidt (1979), and Amihud and Mendel- 
son (1980). In CMSW (1979) we discuss this literature and contrast 
it with the formulation presented here. Except for Ho and Stoll 
(1980), there has been no explicit consideration of the transition from 
individual spreads to the market spread. It will be clear from our 
analysis that this transition is not a simple aggregation process and 
that the market spread is the product of a dynamic interaction in- 
volving many market participants. Also, previous theoretical models 
have generally assumed that investors can be dichotomized into two 
groups—immediacy demanders and immediacy suppliers. Our 
model of investor order placement strategy suggests that such a 
dichotomy will not be observed in the marketplace. 

For the market, the spread is the difference between the lowest ask 
and the highest bid of all participants. In markets composed of many 


* We have elsewhere (Cohen et al. 1980) considered how the impact of transaction 
costs on stock price movements introduces serial correlation in returns data and causes 
estimates of the market model beta coefficient to be biased. 
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traders with heterogeneous beliefs and trading propensities, one 
might expect to have orders at virtually every permissible price in the 
neighborhood of equilibrium and hence to find no significant market 
spread.’ However, we show that even when expectations and trading 
propensities are heterogeneous, the spread is a property of asset 
markets that have temporarily cleared. The analysis yields an exis- 
tence proof of a noninfinitesimal spread with continuous pricing; a 
fortiori this proves for discrete prices the existence of a positive 
spread for nontrivial reasons.‘ 

The essence of our argument is as follows. At any point in time, any 
investor might alternatively seek to trade via a limit order (be an 
immediacy supplier), trade with certainty via a market order (be an 
immediacy demander), or not seek to trade at all. Limit orders create 
the book, and market orders clear out limit orders. Because execution 
via market order is certain, while execution via limit order is not, it 
never pays for any investor to place a limit order (e.g., a bid) at a price 
too close to that of a counterpart limit order (e.g., an ask). Intuitively 
stated, as a trader contemplates placing a bid closer and closer to an 
ask already established on the market, he is increasingly attracted by 
this counterpart offer; at some point, the “gravitational pull” exerted 
by the established ask will dominate. The trader will “jump” his price 
and execute with certainty via a market order. 

Section II establishes the scenario for our analysis. Section III 
focuses on the probability of a limit order executing and shows that, 
with transaction costs, this probability does not go to unity as the price 
at which a limit order is placed becomes infinitesimally close to a 
counterpart market quote. This demonstration underscores an inves- 
tor’s need for an order placement strategy and provides the founda- 
tion for our gravitational pull model. Section IV models the investor’s 
order placement decision and develops conditions under which he 
will transmit limit or, alternatively, market orders to the market, or do 
nothing. The analysis is developed in a dynamic programming 
framework, although we are interested in the descriptive modeling of 


? When an asset market has cleared, there is neither excess demand nor excess supply 
in the sense that at that moment no market participant is willing to buy the asset at a 
price equal to or greater than the ask, and no one is willing to sell at a price equal to or 
less than the bid. Hence we must have a market spread that is at least equal to the 
minimum allowable price change for the asset in question (on the major U.S. stock 
exchanges, this is 1/8 of a dollar for most common stocks). However, it is not obvious 
why spreads greater than minimum allowable price changes are commonly observed. 

3 While the analysis presented here is applicable to any asset market, our formal 
model treats the secondary market for financial securities. Security markets have two 
convenient properties: All units of an asset are identical, and such markets are imper- 
sonal (which means that bargaining, as distinct from trading, strategies need not be 
considered), 

4See n. 2 above. 
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an investor’s decision process rather than in actually generating a 
normative solution. Section V demonstrates that implementation of 
the strategic order placement decision (which implies the gravitational 
pull effect) causes a noninfinitesimal bid-ask spread to exist. This 
section also defines an equilibrium bid-ask spread, discusses condi- 
tions under which it will exist, and shows that it is positively related to 
market thinness. Section VI considers implications for the design of a 
security market trading system and summarizes our analysis. 


II. The Scenario 


Consider an investor who maximizes the expected utility of terminal 
wealth and, for simplicity, let him allocate funds between only two 
assets: a risk asset and cash (which we take to be the numeraire asset). 
In the absence of transaction costs, the market would be monitored 
continuously and appropriate transactions would be made with each 
change in the market price and the investor’s demand propensities. 
Then, if price were continuous, there would be no spread and the 
market price would be determined by a straightforward aggregation 
of individual demand propensities. 

However, a variety of transaction costs impact on the investor’s 
trading decisions. The fixed (with respect to number of shares traded) 
costs of assessing information, monitoring the market, and conveying 
orders to the market imply that the investor will make trading deci- 
sions only periodically. Further, when decisions are made, he will not 
convey his full set of demand propensities to the market. For one 
thing, trades that involve sufficiently small portfolio adjustments 
would not justify the transaction costs incurred.® Also, attempts to 
transmit several limit orders simultaneously would be likely to over- 
load our current system. Furthermore, a continuous auction which 
does not generate a Walrasian solution cannot readily handle multiple 
buy-sell orders that, ex ante, are intended to be alternatives.® 

In light of transaction costs (and also taking account of the timing 
and magnitude of exogenous cash flows), the investor will establish a 
discrete set of decision points. In the analysis presented in Section IV, 
we take the frequency of these points as predetermined. Upon 
reaching a decision point, an investor can do nothing, or hit an 


° It can be readily demonstrated that variable as well as fixed transaction costs make 
sufficiently small portfolio adjustments prohibitively expensive. 

§ That is, if any array of buy and sell orders from one investor is executed sequen- 
tially, the desired quantities at each price should be dependent on the exact sequence of 
purchases and sales followed. However, the investor does not know ex ante which 
specific sequence will occur. This problem could, of course, be handled (at a cost) by 
conditioning orders on the sequence of prior transactions (i.e., if the limit order to buy 
200 at $50 executes, then sell 100 at $56). 
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existing limit order with his own market order, or place his own limit 
order at a “better” price and run the risk of its not executing. A 
concurrent strategy issue also exists when the investor finds it pro- 
hibitively expensive to convey his entire set of demand propensities to 
the market; for all intents and purposes, he must select the single best 
alternative to transmit. 


III. The Probability of a Limit Order Executing 


In this section, we establish the conditions under which the probability 
of execution does not approach a limit of unity as the price at which 
the limit order is placed is taken to be infinitesimally close to the 
counterpart market quote. Under these conditions we obtain a prob- 
ability jump (at the counterpart market quote) that underlies the 
gravitational pull effect developed in Section V. By showing that the 
probability jump can be attributed to the existence of transaction 
costs, we establish the basic linkage between spreads and transaction 
costs. 

Consider the case where an investor contemplates submitting a limit 
bid at the price P/¥ at time t, and let price be a continuous variable. A 
similar argument can be constructed for the case of a limit ask. As- 
sume that if the limit order is unfilled by the next decision point at 
time ¢ + 1 the order will be canceled. Let L be the length of time 
between decision points ¢t and t¢ + 1. 

Let Pi4 be the market ask price at time ¢. Consistent with the 
random walk version of the efficient markets hypothesis, we make the 
Markov assumption that each subsequent market ask depends only on 
the last previous market ask. If we also assume that each change in the 
natural log of the market ask, Z;, is a random variable that is indepen- 
dently and identically distributed over time with mean zero and vari- 
ance var (Z;), then we can model the market ask price generation 
process as a compound Poisson stochastic process:’ 


7 For expositional simplicity we assume that the stochastic processes considered in this 
section have no drift (that is, their expected value is zero). It should be noted that even 
though In P¥4 is assumed to have no drift, the price series itself, P?“, may have drift. 
For example, when In (P/{4/P#"4) is normal, with mean zero and standard deviation o, 
then P#4/P¥4 is log normal with mean exp (4 o”). 

There will be a realization of the random variable Z; at each point of time when any 
one of the following events occurs to affect the specific limit order which sets the market 
ask: (a) it is withdrawn; (b) it executes against a crossing buy order; or (c) it remains on 
the book but is no longer the market ask since a lower limit sell has been submitted. 
Note that events of types a and 6 necessarily result in a change in the market ask 
when price is continuous and utility functions are heterogeneous (which implies a zero 
probability of having two or more orders at a specific price). Also note that only events 
of type b are associated with transactions; hence the number of Z; which materialize 
during any interval will usually exceed the number of transactions in that interval. 
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N(A) 
eA Ae eae (1) 
i=1 
where A is the time from the last decision point. When A equals L, we 
have P#4(A) = PMA; when A equals 2L, we have P#4(A) = P/#4, etc. The 
number of changes in the market ask that take place in the time 
interval A is N(A). We assume N(A) follows a Poisson process with 
arrival rate v. 

The next step is to determine the probability of execution (during a 
time interval of length L) of a limit bid which is submitted at a price 
P?® greater than the current market bid (P¥*) but less than the 
current market ask (P?”4). Clearly, as the limit bid price approaches 
(from below) the market ask, the probability of execution increases. 
One might suppose that this probability approaches unity as the limit 
bid approaches the market ask; however, this need not be the case. In 
the Appendix we restate formally and prove: 


Proposition 1: If P/ is generated by the compound Poisson pro- 
cess of equation (1), then no matter how close the limit bid approaches 
(from below) the market ask, the probability of the limit order 
executing is less than unity in any time interval of finite length. 


Since a market order will always execute with probability one, propo- 
sition 1 gives a probability jump at the market ask. 

Transaction costs are crucial to the existence of the probability 
Jump. In the absence of transaction costs, one might expect that, 
following the work of Merton (1973), the logarithm of the market ask 
would best be described by a Wiener process with zero drift: 


d In P##4(A) = adZ (A), (2) 


where @ is the instantaneous variance of the process and dZ(A) is a 
standardized Wiener process. In this case, the price P/4(A) would 
experience an infinite number of adjustments in the interval 0 < A < 
L. In the Appendix we restate formally and prove: 


PROPOSITION 2: If P/#4(A) is generated by the Wiener process of 
equation (2), then as the limit bid approaches (from below) the market 
ask, the probability of the limit order executing approaches unity for 
all time intervals of the finite length. 


Proposition 2 implies that for the Wiener process there will be no 
probability jump at the market ask. 

We next consider whether there is a relationship between the com- 
pound Poisson process of equation (1) and the Wiener process of 
equation (2). In the Appendix we restate formally and prove: 
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Proposition 3: If the random variable Z; in the compound Poisson 
process of equation (1) has two equally likely possible values, +a@ and 
—a, and if the arrival rate v of this compound Poisson process 1n- 
creases without bound while a simultaneously decreases in such a way 
that ve’ is constant, then the compound Poisson process approaches 
the Wiener process described by equation (2) with the variance of the 
Wiener process, o?, equaling va?. 


Proposition 3 states that the compound Poisson process can be ex- 
pected to approach the Wiener process under appropriate assump- 
tions.® Thus the probability jump of proposition 1 would disappear as 
v increased and var (Z;) decreased. In the Appendix we restate for- 
mally and prove: 


PROPOSITION 4: As the arrival rate v in the compound Poisson process 
of equation (1) increases, the probability of a limit order executing 
increases for all P}? < PMA, 


Hence the probability of a limit order executing increases at all P}# 
< P#4 as the activity proxy v increases. Stated conversely, the “thin- 
ner” a security, that is, the less active are investors in submitting 
orders to trade in the security, the lower will be the probability of 
execution at each P/? < P¥4, and therefore the greater will be the 
probability jump at P¥4, 

The four propositions stated in this section (and proved in the 
Appendix) have important implications for the analysis of the impact 
of transaction costs on bid-ask spreads. Without transaction costs, the 
market price could be expected to behave as a Wiener process; that is, 
there would be an infinite number of infinitesimally small price ad- 
justments. The investor who was considering placing a limit order 
could reduce the probability of his order not executing to as close to 
zero as desired simply by placing his order close enough to the 
counterpart market quotation. 

On the other hand, with transaction costs and a finite number of 
investors, the market price would generally not behave as a Wiener 
process. Investors would find continuous adjustments in their 
portfolios too expensive, and market prices would behave as a 
stochastic jump process (proposition 3 shows that this process could 
generally be expected to approach a Wiener process as the order 
arrival rate is increased). One such jump process is the compound 
Poisson, and it also is consistent with a (martingale) efficient market. 


8 The particular distribution chosen for Z; in proposition 3 is not critical. Other 
appropriately chosen discrete or continuous distributions can also be shown in the limit 
to go to the Wiener process. 
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Proposition 1 demonstrates that for such a stochastic jump process a 
probability jump exists. Proposition 4 has shown that the probability 
jump will be greater for thinner securities. In Section V we will show 
that the larger probability jump for thinner issues leads to equilibrium 
market spreads which are larger for thinner issues. 


IV. A Model of Investor Order Placement Strategy 


We now consider the question of when an investor will choose to trade 
via a market order or limit order, or not seek to trade at all. The 
problem is structured as follows. Because of the costs of monitoring 
the market, let the investor consider rebalancing his portfolio only at 
preselected points in tame,¢= 17275... 2 — 1, where 7-15 tne 
investment horizon.® In order to simplify the analysis, we now con- 
sider the placement of only purchase orders for the risk asset (omit- 
ting the symmetric case of sell orders without loss of generality). 

At any of the T — 1 decision points, the investor is faced with three 
possible courses of action: 

a) Submit a market order to buy shares at the current market ask 
price of P¥4, 

b) Submit a limit order to buy shares at a limit bid price of P}#? < 
Eee 

c) Do nothing. 

In modeling the investor’s choice among these three alternatives, 
we find it convenient to make the following assumptions that simplify 
the analysis without materially affecting the nature of the conclusions. 
We assume that all orders are for a fixed number of shares AN and 
that when any market or limit order is executed, it is satisfied fully at 
the stated price; this avoids the tedium of writing (average) transaction 
price and transaction costs as functions of the number of shares 
exchanged and of defining probabilities of partial execution. We as- 
sume that unfilled limit orders are canceled prior to the next deci- 
sion point; this avoids both the need to include additional state vari- 
ables (the price and quantity of any limit order outstanding) and the 
need to analyze additional courses of action (submit a market order 
and leave the old limit order outstanding, or submit a market order 
and remove the limit order, etc.). Finally, we assume no lags in the 
transmission of information and orders; this avoids the complexity of 
dealing with changes in the current market quotes during the time the 
investor formulates and implements his decision. 


* Note that the rebalancing points need not be the same for all investors, so that 
trades can occur at any time when the market is open even if specific traders are not 
continually in the market. 
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If option a is chosen, the market order will be executed, and the 
investor’s holdings at time t + 1 become N,,, = N,; + AN, for which the 
investor pays a total cost of AN - P@4 + C™ where C” is the total cost of 
transmitting and executing the market order. 

If option 6 is chosen, then one of two events can take place: (b1) 
The limit order is executed. The investor’s share holdings then be- 
come N;4, = N; + AN, for which the investor pays a total cost of AN - 
Pi? + C* + C” where C”" is the cost of transmitting a limit order to 
the market and C” is the cost of executing a limit order. (62) The limit 
order does not execute and is canceled prior to the next decision 
point. The investor’s share holdings then remain unchanged (N,4, = 
N,) and his cash is decreased by the cost of transmitting the limit 
order, C1. Option b will be chosen over a if the gain associated with 
the possibility of trading at a more favorable price outweighs the loss 
associated with the probability of not trading at all. 

The investor must consider four subjective probability distributions 
in order to make an optimal decision. These are: (1) the joint prob- 
ability distribution of market bid and ask prices at time ¢ + 1, condi- 
tional upon the quotes at time t; (2) the probability of a limit bid order 
submitted at time ¢ executing before time ¢ + 1; (3) and (4) the joint 
probability distribution of market bid and ask prices at time ¢t + 1, 
conditional upon the quotes at time ¢ and further conditional on 
whether a limit bid submitted at time ¢ either did or else did not 
execute prior to time?¢ + 1. In the Appendix we discuss in more detail 
these four subjective probability distributions (only three of which are 
independent). 

A dynamic programming model of the investor’s choice among 
options a, b, and c is formulated in the Appendix. The investor is 
assumed to maximize his expected utility of terminal wealth, max (U,, 
U,, U3), where U, is the expected terminal utility of choosing option 
a, trading via a market order; U, is the expected terminal utility of 
choosing option 5, seeking to trade via a limit order; and U; is the 
expected terminal utility of choosing option c, doing nothing. It is 
convenient to focus on the utility gain, AU, or AU», which results from 
choosing option a or 6 rather than option c: AU, = U, — Us; 
AU,(P/4) = U,(P}) — U, (note that U, and AU, are functions of the 
limit bid price). Clearly AU,(P/2 = P¥4) = AU,, since, at this price, the 
market order and limit order strategies are effectively the same (the 
probability is, unity of the limit order executing at a price of P}M4). 

Let us now consider the conditions under which U, > max (U2, Us), 
in which case the investor will submit a market order, or U, > max 
(U,, U3), in which case the investor will submit a limit order. Suppose 
that at the current market quotes the do-nothing strategy is domi- 
nated (i.e., max [U,, U,] > U3). Given our utility gains AU, and 
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AU,(P#4) and the probability function for limit order execution de- 
veloped in Section III above, we know the utility gain from placing a 
market order and can readily obtain an expected utility gain function 
for the limit order strategy. These are depicted in figure 1; the shape 
of the function is explained as follows: 

1. While the utility of a consummated trade decreases monotoni- 
cally with P}, the probability of execution increases with P7”, with two 
probability jumps (at the market bid and the market ask). The jump at 
P¥B8 simply reflects the institutional reality that orders placed at prices 
less than or equal to P#® would not have priority over the current 
market bid, whereas an order placed at a price above P/? would be 
at the top of the limit order book. The jump at P?4 follows from 
proposition 1. 

2. Since the probability of execution is constant to the right of P4, 
the expected utility gain has a peak at P“4, corresponding to the utility 
gain of transacting by a market order. 

3. The probability of execution increases between P¥? and P™4, 
with the greatest relative increase just to the right of P#8. We expect 
this large probability increase in this neighborhood because the strat- 
egy considerations of other investors might lead them to place limit 
orders just to the right of P/’? in order to capture the largest price 
advantage. Hence the second peak in the expected utility gain func- 
tion will occur at some point P", between P¥8 and P™4, 

4. The probability of execution decreases rapidly in a neighbor- 
hood just to the left of P/® because of existing limit orders on the 
book. However, since there would be a clustering of limit orders near 
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P83, we would expect the probability of execution to decrease more 
slowly as the limit price, P/?, moves further from P¥2. Hence, we 
might find a third peak in the expected utility gain function to occur 
at some point P’ to the left of P¥8, 

Which of the three peaks in AU,(P#) will dominate depends on 
the particular form of the probability function for limit order execu- 
tion as well as the expected utility function. In the particular diagram 
presented in figure 1A, AU,(P/*) has positive values (hence a do- 
nothing strategy is dominated), has values greater than AU, (hence 
the market order strategy is dominated), and gives a global maximum 
at P”. Assuming only one limit order per investor, the optimal strategy 
is to place a limit bid at P”, and hence P” will become the new market 
bid. The setting of such a bid quotation can be considered a Stigler 
(1961)-type search activity.'® 

Clearly the limit order strategy need not always be superior; in 
selecting this strategy, the investor also accepts the chance that the 
limit order may fail to execute, in which case the investor would be 
worse off than had he done nothing since he would have also lost the 
cost of placing the limit order, C1. If the probability of failing to 
execute is high enough and AU, > 0, the investor will prefer the 
market order strategy as illustrated in figure 1B.1! 


V. The Market Spread 


It is clear from the preceding analysis (Sec. IV) that, with a 
continuous auction market, each investor’s order placement decision 
is made with reference to prices already established on the mar- 
ket—the bid and ask quotations which define the market spread. In 
turn, each investor’s order may affect the market spread to which 
subsequent traders react. Hence, the market spread is the product of 
a dynamic interactive process. In this section we show that a nonzero 
market spread must exist, define the equilibrium market spread, and 
relate a security’s equilibrium spread to its thinness. 

The limit order book comprises the limit orders transmitted to the 
market by a subset of the many traders in a security, and the spread is 
essentially a gap in the limit order book. Therefore, having estab- 


10 By the placement of such a limit order, a seller or buyer announces his propensity 
to trade with the hope of getting the attention of a counterpart market participant who 
would also be willing to trade. ; 

11 There are two other versions of fig. 1. One would be analogous to panel 4, but with 
the global maximum at P’ rather than at P"; in this case, the investor would submit a 
limit bid at P’ (below the market bid). The second version would be where the zero 
point on the utility gain axis is higher than both max AU,(P/*) and AU; in this case, the 
investor would do nothing. 
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lished that individual investors will sometimes seek to trade via limit 
orders, we must show why a noninfinitesimal gap in the array of such 
orders exists between the market quotes. We do so by considering the 
impact that the jump in the probability of a limit order executing (Sec. 
III) has on the investor’s optimal order placement strategy (Sec. IV). 

We have established the conditions under which a limit order will be 
placed; we can now consider whether or not limit orders will neces- 
sarily be placed so as always to preserve a nonzero bid-ask spread. By 
continuity of the utility function, and because of the discontinuous 
drop in the probability of execution that must occur at the market ask 
as we move to lower prices, we must have a discontinuous drop in 
AU,(P}*) as we move past the market ask to lower prices. Then, 
because of continuity of the probability function until we get to the 
market bid (at which point another discontinuous drop occurs), it 
necessarily follows that AU,(P}/*) cannot exceed AU, within a nonzero 
neighborhood to the left of P/4. Hence, no limit bid will be placed 
infinitesimally close to the market ask, and therefore we must have a 
nonzero bid-ask spread. 

Intuitively viewed, consider an investor who is thinking about 
whether or not to place a limit bid at some price below the market ask. 
As the potential limit price rises toward the market ask, it becomes 
relatively more attractive to transact at the market ask with certainty. 
At some point, the investor will prefer buying at a discretely higher, 
unitary probability of execution. In other words, certainty of execu- 
tion at the market ask creates a kind of gravitational pull which causes 
investors to jump their price once their potential limit bid gets close 
enough to the market ask. This clears out limit bids when they get too 
close to the market ask, thereby leaving a nonzero spread between the 
market bid and ask. 

We next consider the issue of an equilibrium market spread. 


DEFINITION: In a dynamic trading process we define the equilib- 
rium market spread as the bid-ask spread at which, for the next instant 
of time, the probability of the spread increasing is equal to the prob- 
ability of the spread decreasing.” 


This definition does not imply that if away from equilibrium the 


_ ™ Note that the probability is not conditioned on the current status of individual 
Investors, recent prices, or other market conditions. 

Note that we use the term “equilibrium market spread” to refer to one particular 
point in the probability distribution of the market spread, rather than to refer to a 
statistic for the central tendency of the entire distribution. We do not adopt a central 
tendency definition (such as expected value, mode, or median) of the equilibrium 
spread because it does not provide a ready link to the market forces that generate the 
spread and to thinness. For empirical research, the average market spread is likely to be 
a reasonable proxy for our definition of the equilibrium market spread. 
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spread will necessarily move toward it on the next order, but only that 
it is more likely to do so than not. 

We assume that a unique equilibrium exists. To examine the rea- 
sonableness of this assumption, first consider an arbitrarily small 
spread. Because the probability of a limit order being placed has a 
discrete jump at P¥4, we have already argued that there will be no 
limit orders placed in some nonzero neighborhood below P™4 by all 
investors seeking to buy the security. (The generalization from a 
single investor to all investors takes place simply by choosing the 
smallest neighborhood found among all investors.) Therefore, for a 
sufficiently small bid-ask spread, the unconditional probability of the 
spread increasing must exceed the unconditional probability of the 
spread decreasing as analyzed from the standpoint of the buyer. A 
similar argument holds for the seller. 

Now consider what occurs when the spread widens. The larger the 
spread is, the greater the potential utility gain from an optimally 
priced limit order, while the utility gain from a market order will 
remain unchanged or fall. The gain in utility occurs because the 
shares can be bought (sold) at a lower (higher) price via limit order 
than by market order.'* With the rise in utility U,(P#*), investors will 
begin to shift their preferences from market orders to limit orders. As 
more and more investors shift to limit orders, the probability of the 
spread increasing falls, while the probability of it decreasing rises. 

One of two situations must occur. Either the spread would tend to 
reach a point at which the two probabilities would be equal, in which 
case it would be the equilibrium market spread, or the limit order 
book would be empty on one or both sides.'* Since when the order 
book is empty market orders cannot transact, we would be at a point 
where the probability of a limit order is greater than that of a market 
order. Although such a point might be achieved without the equilib- 
rium market spread ever being attained, we believe it would be 
atypical.’® 


13 We assume that the spread does not impart any information to the investor on the 
direction of future price movements. 

4 No other possibility exists. So long as there is a greater probability that the spread 
will increase than that it will decrease, the spread will on expectation grow—limited 
only by the collapse of the limit order book on one side or the other. _ 

15 With regard to the assumption of uniqueness, investors might have utility functions 
which could lead to more than one equilibrium market spread. Consider the case where 
an investor views a very wide spread as signaling the advent of new information which 
may cause a price adjustment of unknown direction. The investor sensing greater risk 
might now choose a do-nothing strategy over placing either a limit or market order. 
This could support a very wide spread as a new market equilibrium spread. Of course, 
investors would eventually conclude that the new information was not forthcoming, 
and trading would resume at its former frequency with the old equilibrium spread 
reestablished. 
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Our definition of the equilibrium market spread provides a direct 
link between the market spread and thinness. Recall that proposition 
4 showed that the probability of a limit order executing decreases, for 
all values of P#8, as a security’s order arrival rate decreases. Thus, 
ceteris paribus, thin issues have a lower probability of limit orders 
executing than do thick issues. This lower probability in turn de- 
creases the proportion of investors choosing limit orders over market 
orders at any given size of the spread; this implies that, for a thinner 
issue, a wider spread would be required for the forces that increase 
and decrease the spread to be in equilibrium. Hence we have: 


Proposition 5: Thinner securities will, ceteris paribus, have larger 
equilibrium market spreads. 


VI. Conclusion 


We have presented a proof of the existence of market spreads in 
markets with many traders. The formulation treats continuous prices 
and allows for heterogeneous trading propensities. The proof is not 
dependent on the demand of traders for immediacy or on the cost to 
market makers of providing immediacy. 

The literature on bid-ask spreads does not appear to have recog- 
nized that aggregation from individual to market spreads is a con- 
siderably more complex process than the standard aggregation from 
individual to market demand and supply functions. Neither has it 
been established that ordinary investors may sometimes seek to trade 
via limit orders (and at other times via market orders), hence that 
these investors will sometimes supply (and at other times demand) 
immediacy, and that in choosing between market and limit orders 
investors implement an order placement strategy. In addressing these 
issues, we have sought to establish the links between transaction costs, 
individual investor order placement strategy, market thinness, and 
market spreads. 

We have first established that, with transaction costs, the probability 
of a limit order executing does not rise to unity as the price at which the 
order is placed gets infinitesimally close to a counterpart market 
quote. We have next shown that the resulting investor trading strate- 
gies generate what we have referred to as a gravitational pull effect. 
Essentially, in the neighborhood of the current market bid and ask 
quotations, what might otherwise have been limit orders are instead 
submitted as market orders (at slightly less desirable prices) so as to 
achieve certainty of execution. These market orders trigger trades 
which clear limit orders off the book, widening the market spread. 
The gravitational pull effect explains why market spreads may be 
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substantial even in markets composed of many traders. Finally, we 
have defined an equilibrium market spread (where the forces that 
tend to widen and to narrow the spread are in balance) and have 
shown it to be positively related to a security’s thinness (measured 
inversely by the order arrival rate). 

Our formulation has several implications for the design of a market 
system. A primary objective of system design should be to expand 
the extent and frequency with which investors interact with the mar- 
ket by minimizing various transaction costs. Decreasing variable 
transaction costs will decrease individual spreads and generate a 
greater order flow; decreasing the costs of monitoring and com- 
municating with the market will also increase the frequency with 
which investors rebalance their portfolios; and consolidating the cur- 
rently fragmented system (by, e.g., instituting a consolidated limit 
order book) will reduce search costs and further shrink spreads by 
increasing the effective thickness of the market. These costs are all a 
function of market structure and hence should be amenable to 
reduction by appropriate system design. However, a major cost of 
interacting with the market is the cost of decision making, and this 
might not be subject to significant reduction by exchange organiza- 
tion. For this reason, it is possible that, especially for thinner issues, 
spreads will remain sizable in a restructured national market system. 


Appendix 


In this Appendix, we restate formally and prove propositions 1—4 of Section 
III above and develop the more technical aspects of the arguments presented 
in Section IV. 


Section IT 

Equation (1) of Section III presents a compound Poisson process as the 
stochastic process which generates a sequence of market ask prices over time. 
This can be used to determine the probability that a limit bid order will 
execute during a particular time period. 

Suppose that the price of the limit order to be submitted is greater than the 
current market bid but less than the current market ask, that is, that P¥8 < 
P¢® < P#4, To determine the probability that the potential limit bid will 
execute in a time interval of length L, we must find the probability that P4(A) 
will decrease to a price equal to or less than P/*.1* Therefore, let Y (L, P“, P#) 
be the probability that the minimum value that P¥4(A) achieves in the interval 
0 < A SL is equal to or less than P/*. As P?* approaches P//4, we would expect 
the probability of the potential limit bid executing to increase, since the 
amount that P//4(A) would have to decline would be reduced. However, in the 


6 For P}4 discretely greater than P/”, the possibility of another investor submitting a 
limit order with a price greater than P7” will only tend to decrease the probability of 
execution of the original limit order. Thus our proof of existence of the probability 
jump at P4 is conservative. 
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limit as P}® approaches P/4 from below, without further analysis it is unclear 
how far the probability will rise. Therefore, let 
(LEP 4) salma ek (Eero )s 


P}S —» PMA 
where the limit is understood to be from below. We now prove: 


Proposition 1: If P¥4(A) is generated by the compound Poisson process of 
equation (1), then ¢(L, P#4) > 0 for all intervals of length L < ©.'7 

Proor: Since Z; is stochastic with mean zero, P (Z; = 0) > 0. Furthermore, 
the value of the Poisson random variable N(L) will be finite in any interval of 
length L less than infinity. Therefore, the probability of N(L) consecutive Z; 
observations that are greater than or equal to zero is given by [P(Z; 2 0)]™, 
which, since N(L) is finite, must be strictly greater than zero. Notice that if all 
the Z; observations are greater than or equal to zero, the value of P¥4(A) must 
be greater than or equal to P#4 throughout the interval 0 < A < L; thus the 
limit order P/# would not have executed. This is sufficient to demonstrate the 
probability jump. Clearly, there are other sample paths that P?/“(A) could 
have followed which also would have failed to execute the limit order. 

PRoposITION 2: If P}4(A) is generated by the Wiener process of equation 
(2), then #(L, P4) = 0 for all intervals of length L < ~. 

Proor: By the reflection principle for a continuous Wiener process (see 
Karlin 1968, pp. 276-77) and since In P#/4(A) is driftless, 

Y(EyP¥4s Prey "Primm Pe4(Ay) = Py Primm in P(A) = Pr 

0<A<L O<A<L 


2 Pr{ln PMA lnPP) = 2 Pr{PH4 =P} 


ee eet 
oV 27L © -@ , 2L o 


where the latter probability distribution follows from the definition of the 
Wiener process. Now 


: D pi 1 x — PMA \2 
lim Y(L, P¥4, PH) = —=— | © exp |- ae (= } lax. 
Pi? —> PMA GN 2 Te 2L oy 


Substituting the variable y = (x — P¥4)/(oVL), the preceding limit equals 
(2/V/2ir) f°. exp {-% y*}dy = 2(%) = 1. r 

Proposition 3: If the random variable Z; is expressed as a Bernoulli 
random variable, with Pr(Z; = a) = Pr(Z,; = —a) = 1/2, and if the arrival rate v 
of the Poisson process N(A) goes to infinity, while simultaneously reducing the 
size of a in such a way that va? remains constant, the compound Poisson 
Buccs approaches the Wiener process described by equation (2) with o? = 
va’. 

Proor: This follows from the theorem that the characteristic function for 
the compound Poisson approaches that of the Wiener process (see Parzen 
19629 p2 99): 

ProposiTIon 4: [OY(L, P”4, P#*)|/ov > 0 for all P/? < P¥4 and L < ~, 

Proor: By the Markov property of the compound Poisson process, in- 
creasing (decreasing) the order arrival rate by some factor ) is identical to 


‘7'This proposition would also hold for other stochastic jump processes where the 
number of price changes is finite in any finite interval. 
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increasing (decreasing) the length of time L between decision points by X. 
When 4 is increased, those sample paths of the process which initially satisfied 


min —P¥4(A) =P 
OSASL 


will still continue to do so. On the other hand, some sample paths where 


min P¥4(A) > PLB 
0<ASL 


will now satisfy the inequality 


min P¥A(A) < PEE, 
OSASAL 
Therefore, Y(L, P#4, P#*) would increase for \ > 1 for all values Olea apa 
oe Y(L, P#4, P?*) would decrease if the order arrival rate \ is de- 
creased. 


Section IV 

We now present the more technical aspects of the model of an investor’s order 
placement strategy discussed in Section IV. The subjectively determined 
probability distributions are specified as follows. First, the probability distri- 
bution of future market bid and ask prices is given by h(P#4, PMB eee pee 
where h is a joint density function for market asks and bids (PM4, P28) at time 
t + 1, given the prices at ¢. Consistent with random walk models of security 
price behavior, we condition future prices only on current prices. Consistent 
with Section III, we let the investor’s subjective probability of a limit order 
executing before t + 1 be given by p(P/}4, P¥4, P¥*), where P/# is the limit bid 
price. (Again, this is consistent with random walk theory, since we need know 
only current market quotes to predict whether or not the limit order will be 
executed.) Last, we assume the investor determines conditional probability 
distributions of future prices in the event of either a successful or unsuccess- 
ful limit order. That is, for a limit bid at price P/® submitted at time ¢, let 
k( PUA, PU | P24, P¥®, P}*) be the joint density function of market bid and ask 
prices at time ¢ + 1 if the limit bid executes prior to ¢ + 1, and let 1(PM4, 
PIE | PMA, P¥®, P?*) be the joint density function at time t + 1 if the limit bid 
fails to execute prior to t + 1. Note that only three of the four subjective 
probability distributions h, p,k, and / can be independently determined, since 
by Bayes’s theorem we must have 


Rig PEP (PPP, PH ko lee Pee Pe) 
ap Pr Pt BYE, 9 PUA PHEN Py 


for all possible values of P#4, P¥?, and P}*. 

We can now give a formal statement of the model. Given observed market 
quotes of P#4 and P#? and holdings by the investor of N, shares of the security 
and S, dollars in cash, at time ¢ the investor’s expected utility of terminal 
wealth can be written as W, = f,(P¥@4, P¥®, N,, S,).18 


** Note that the investor’s underlying utility function measures the utility of the 
wealth he will possess at some horizon T, where T > ¢. The investor will choose those 
decisions at timest, ¢ + 1,...,7 — 1 which maximize the expected utility of his terminal 
wealth. As of time ¢ (the investor’s current decision point), viewing the expectations 
operator as ranging over relevant random variables pertaining to times between ¢ and 
T, we define yp, to be the expected utility of the investor’s terminal wealth. Clearly this 
depends upon the assets the investor has at time t (N, and S,) and the current market 
prices at which the investor can buy or sell shares (P¥4 and P¥). 
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We now derive an expression for y%; in terms of the various probability 
assessments and costs. Write w, = max (U,, Uz, U3), where U; = expected 
terminal utility if a market order is placed at time ¢, Uz = expected terminal 
utility if a limit order is placed at time t, and U; = expected terminal utility of 
doing nothing at time t. More precisely, letting x and y be values for the ask 
and bid prices at time't + 1, respectively, we have 


U, =| [ore Goya’ act AN, S; =e IN Pee ‘ h(x, | P#4, P¥®)dxdy. 
0 0 


As stated in Section IV, we assume that the number of shares to be purchased, 
AN, is fixed. Define the function 


UKP A) = ppt, Pa, PH) | J feats. ah AN, 
S:— C4 — C? — AN - PH) - k(@,9 | Pi4, PY, PZ?) dxdy 


at [1 — pipe, pis, Penn | J Sesilx, y, Ne, S; a Cc) 


oh (ay) Pas teen ete \dxdy 
for all possible prices of the limit order. Then we have 


Up = wren UIE). 
Pe 


Finally, for the do-nothing option we have 


Us -f J foator.ne 50 ‘h(x, y | P¥4, PHP) dcdy. 


The dynamic programming recursion permits us to obtain a solution for /; 
in the order T — 1,T — 2,..., 2, 1. This is possible since we assume the utility 
value for f7 is known for all values of the parameter P}/4, P¥8, Np, and S7. We 
have defined this recursion in terms of the four state variables P¥4, P¥?, N,, 
and S;. We also have two decision variables, P/? and the decision as to which of 
the three courses of action to take. 
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We explore the conditions under which the welfare benefits of local 
public goods projects will be capitalized into land values. We find two 
types of sufficient conditions, one involving similar communities and 
the other involving differentiated communities. The form of 
capitalization differs between these cases, and we explore the nature 
of these differences. We also examine intermediate cases and iden- 
tify models in which there will be no capitalization. 


I. Introduction 


The idea that the value of local public goods projects will be 
capitalized into land rents has a long tradition in the public finance 
literature, dating back at least as far as Tiebout (1956). Empirical 
evidence has been mixed, but some positive results have been re- 
ported by Oates (1969), among others. This simple idea is potentially 
very powerful. It provides a simple measure of project benefits which 
reveals (if perhaps after the fact) individual preferences for public 
goods. And if rent changes can be predicted in advance, it may 
provide an objective function for community decision making. 
Furthermore, the presence of capitalization forces has important 
equity implications. To the extent that capitalization occurs, benefits 
tend to accrue to landlords at the expense of renters. We will show 
that capitalization reflects a real welfare increase only if these two 
groups are considered equally deserving. If, on the other hand, rent- 
ers are considered more deserving, then the presence of capitaliza- 
tion clearly will worsen the distribution of benefits within society. 
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The “capitalization hypothesis” is interesting from a_ purely 
theoretical point of view as well. It is clear that increases in land rent 
values are not themselves a net social benefit; owners of the land will 
benefit, but renters will suffer. Indeed, in the standard surplus theory 
of welfare measurement, changes in land prices (or prices in any 
other competitive market) never appear as net benefits (in aggregate, 
the gains to winners just offset losses to losers). Thus, if capitalization 
occurs, it must happen because the true benefits induce a corre- 
sponding change in rents. 

We will argue that there are two separate and independent forces 
which can induce capitalization, one involving competitive forces be- 
tween communities and one involving such forces within a particular 
community. Furthermore, these two forces have quite different im- 
plications for (1) what benefits are capitalized and (2) where those 
benefits are capitalized. Thus, we will derive several different capitali- 
zation theorems and explore the conditions under which each holds. 
In the process, we will find some conditions under which capitaliza- 
tion does not occur at all or occurs only partially. 

Let us refer to capitalization which derives from forces between 
communities as “external capitalization” and capitalization which de- 
rives from forces within communities as “internal capitalization.” The 
intuitive argument for external capitalization is quite simple. Suppose 
that a project is built in one community designed to make people 
better off there. Now, if people in all communities have similar tastes 
and are free to move among the communities, outsiders will necessar- 
ily be attracted to the project-building community. And they will 
continue to move until the welfare incentive disappears. The only 
factor which can stop this movement is a differential location cost, that 
is, an increase in land rents in the project-building community. This 
type of argument is the one used by Polinsky and Shavell (1976) to 
justify a form of capitalization. However, the Polinsky/Shavell model 
is not very general, since the benefits accrue to absentee landlords. We 
will show below that increases in land values generally will overstate 
true net social benefit, although they may be a good approximation to 
community net benefit. 

Naturally, when we drop the assumption of free mobility between 
communities or the assumption of similar tastes between com- 
munities, the intuition for external capitalization is weaker. Indeed, 
several authors have argued that we should not expect land value 
capitalization in specialized (or Tiebout) communities which are im- 
mune to migration forces due to differences in tastes (see Hamilton 
1975 and Polinsky and Rubinfeld 1978). 

However, there is a quite different argument for internal capitali- 
zation which is based on specific properties of local public goods. 
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Presumably public goods are local in character when proximity mat- 
ters. For example, one has to make trips in order to appreciate such 
public goods as parks, museums, civic centers, sports complexes, and 
even roads and highways. Now, if there is a positive correlation 
between the amounts of public goods and the differential desirability 
of various locations, some degree of capitalization will occur as rents 
adjust. We will show later exactly what conditions are necessary for 
full capitalization. This type of argument has been used in the litera- 
ture to justify the use of rent gradients for measuring the cost of 
pollution near a source (such as an airport or factory).' It is also in 
the spirit of arguments for capitalization given by a number of 
authors, among them Strotz (1968), Lind (1973), and Pines and Weiss 
(1976). These authors essentially start with the assumption that a 
project will differentially improve land quality and explore whether 
or not those improvements will translate into rent increases. The 
results are instructive, but the underlying approach seems to beg the 
question somewhat. The primary question is, Will a local public proj- 
ect change the differential quality of land? For the case of irrigation 
projects discussed by Lind, the answer is clearly yes, but in other cases 
it is less obvious. Indeed, we will show that, for some types of public 
goods, no capitalization occurs at all. 

We will want to pay attention to the system of taxation in discussing 
both types of capitalization. One might guess that the form and 
nature of capitalization will depend on whether or not the public 
goods are financed out of a property tax. We will show that this is 
sometimes, but not always, the case. 

We adopt a somewhat generalized version of the Polinsky/Shavell 
community model in the sequel. This model has a number of special 
features but is chosen because it facilitates the exposition and makes 
for easy comparisons with the preceding literature. However, many of 
the results reported here remain valid in a much more general eco- 
nomic model. The interested reader is referred to Starrett (1978) for 
some general results on capitalization. 


II. A Prototype Model 


Throughout this section we deal with a model in which there is a 
single local public good (q) being produced independently by a num- 
ber of communities. Further, we will suppose that projects initiated in 
a community will not affect private goods prices other than land rents. 


1 For an example of this argument in the literature, see Freeman 1971. 
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This may seem a serious restriction, but it is not, really, since changes 
in other prices will have the usual canceling effects on the two sides of 
the market. With nonland private goods prices fixed, we can aggre- 
gate all these goods into a single commodity and let this commodity be 
the numeraire in what follows. 

Agents must engage in effort (take trips) to enjoy the public good. 
Naturally, such trips are costly; here we will assume that the expense 
can be represented by a simple cost function in terms of the 
numeraire: f(g,s), where g is the number of trips taken and s stands 
for location within the community. Location will be specified by di- 
viding the community’s land into discrete zones and assuming that 
location can adequately be specified by designating the appropriate 
zone. 

Each consumer gets to choose a zone and an amount of land within 
the zone (¢). The market for land will be assumed competitive, and r, 
will stand for the rental rate in zone s. 

Consumers earn income in terms of the numeraire. We will want to 
separate this income into three terms: profit shares (II), rental income 
on land (R), and other income (mainly labor) (Y). Thus, an individual 
in income classi has income J‘ = II’ + Y‘ + Ré. Such an individual will 
be required to pay taxes T'. The T‘ may be a function of other 
parameters, depending on the nature of taxation. 

A consumer of preference type a has a preference function of the 
forma [aie Cl = T* —f(g,s) — r,€). The arguments of this function 
are the level of public goods provision, the number of trips taken, the 
amount of land consumed, and the amount of other private goods 
consumed. It seems appropriate that both g and q should affect utility. 
For example, if the public good is a museum, agents will care about 
the size of the museum as well as the number of visits made. 

Some of the results reported below will depend on the properties of 
U and f. Specific assumptions will be introduced as we go along, but 
the following general discussion may be helpful here. It seems rea- 
sonable that the marginal cost of trips (0f/dg) will increase as the 
distance from the public good increases. We expect to find that use 
decreases with distance if and only if this assumption holds. Also, it is 
reasonable to suppose that the marginal benefit from public goods 
(9U/dq) increases with use (g). This marginal benefit will decrease 
with distance whenever both assumptions hold. 

Each consumer ina particular community gets to choose g, ¢, ands. 
(At a later stage such a consumer may be able to choose a community 
of residence as well.) We will assume throughout that there is free 
mobility within the community, so that s is a free choice. It is ana- 
lytically convenient to think of each resident as making a conditional 
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choice of g and @ for each s. This process generates a first-stage 
optimization problem: 


max U%lq,e,6.1 —1 —=3(@,s)\— ret |. (1) 
oe 


A solution to problem (1) defines demand functions gs! = g°(q,/ lone 
Ti,r,,s) and €¢ = €%(q,I' — T',r,,s) and an indirect utility function V#* = 
VG tel ea L.s)s Ata) later “stage,” the consumer makes a discrete 
location choice, seeking to max, V¢‘. We will use the notation V% to 
denote the final indirect utility function. 

The remainder of the community economy consists of firms, about 
which we need to say little except that they behave competitively and 
are not affected by the public goods, and the government, which 
produces the public good from private goods, paying for those goods 
with revenues from the taxes levied on households. Letting I'(q) stand 
for the public goods cost function, the government must satisfy a 
balanced budget condition T = I'(q), where T stands for total tax 
revenue. 

We assume that the private sector is always in market equilibrium. 
For nonland commodities, this simply means that markets clear at 
fixed (unchanging) prices. For land, it means that rental rates must 
adjust until the market for land clears in each zone. 

We will formulate questions about capitalization as follows. We 
measure welfare using a standard Bergson-Samuelson formulation. 
We ignore distributional considerations by assuming that the distri- 
bution of income is initially optimal (such an assumption is clearly 
necessary for pure forms of capitalization). Next, we consider an 
initial situation and a proposed new project. Some form of capitaliza- 
tion occurs when there is a correlation between the resulting welfare 
change and the associated rental change. We will see that this correla- 
tion can take many forms. 

All precise statements about internal capitalization require some 
boundary condition on community rents. This boundary condition 
will take different forms, depending on the structure of the town and 
the organization of taxation in the town. We will discuss two different 
types of communities in this regard. In the first type of community 
the boundaries will be considered potentially variable, with “outside” 
land always available to the community at an exogenously given op- 
portunity cost; the boundary rent is thus exogenously given in such a 
community. The real prototype for such a community would be a 
town or metropolitan area located in the middle of farmland. We will 
refer to such communities as isolated. 

In the second type of community, the boundaries are predeter- 
mined (presumably by legal arrangement). The effect of a project on 
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boundary rent will be determined by changes (if any) in the aggregate 
demand for land in the community. The real prototype here would be 
a subcommunity within a larger metropolitan area. We will refer to 
such communities as adjacent. 

The local public goods model being proposed here seems most 
applicable to the case of isolated communities, since it is difficult to 
ignore the importance of direct spillovers among adjacent com- 
munities. However, it still seems useful to treat the adjacent case. 

To demonstrate that there are separate forces generating external 
and internal capitalization, we will need to show that it is possible to 
have each without the other. We will begin with a discussion in which 
we assume that no external forces are present; that is, we assume that 
community policy will not lead to any net migration to or from the 
community. Within this context, we explore the conditions for inter- 
nal capitalization. We derive alternative sets of assumptions under 
which there is no capitalization or full capitalization. 

Then we will return to the issue of external forces. We will give 
these forces their best chance to work by assuming free mobility 
between communities. To isolate the effect of these forces, we will 
adopt a model of the community which is consistent with no internal 
capitalization. Within this context, we again exhibit conditions under 
which there is either no capitalization or full capitalization. 


A. Internal Capitalization 


Assuming away migration effects, there are essentially two ways in 
which project benefits can get translated into increased land values: 
through a change in the extensive margin or the intensive margin. Ifa 
project increases the aggregate demand for land, it may lead to an 
increase in the general level of rents (by operating on the extensive 
margin for land). Intuitively, this force could go either way. When 
taxes are raised (to pay for a new project), there is likely to be a 
substitution away from goods (including land). On the other hand, 
when public goods provision is increased, there is likely to be a 
complementary increase in the demand for land. Since the net effect 
on rents is clearly ambiguous, there is no reason to expect systematic 
capitalization from impact on the extensive margin. In fact, we will 
argue that the extensive margin.is likely to be of no importance, 
especially in isolated communities. 

However, the project may affect the differential value of locations 
and thereby influence the rent structure through the internal margin. 
We will show that this force may lead to systematic capitalization in 
relatively large communities, within which there is a wide range of 
choice regarding use of the public goods. Restricting this range of 
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choice can lead to any intermediate case from full capitalization to no 
capitalization. 


B. Sufficient Conditions for No Internal Capitalization 


The essential restriction which is needed to rule out internal capitali- 
zation is that there be no effective differential choice in the use of 
public goods within the community. A number of sets of assumptions 
will do for this purpose. 

The easiest case to explain is one in which there is no differential 
choice within the community at all. By this, we mean that g = g and ¢ 
= @ for all agents in all zones; these conditions would hold if the plot 
size is institutionally given and the public good has the characteristics 
of national defense so that residents get the benefits regardless of 
their actions. 

A resident of preference type a and income class 7 now chooses s 
(the only remaining choice variable) to max, U“[q,g,¢,/' — T' — f(g,s) 
— r,€]. Clearly, this problem reduces for every agent to one of 
minimizing costs: min, [f(g,5) — r,€]. Since the costs to be minimized 
are the same for all agents, the only possible equilibrium rent struc- 
ture must be one which makes those costs independent of s. (At sucha 
cost structure, everyone is obviously indifferent as to where they live.) 
Now, this independence condition must be true both before and after 
the project is initiated. It follows that the derivative of costs with 
respect to q must be independent of s. But with g and @ fixed, this 
means that (d/dq)r, must be independent of s; that is, if rents change at 
all they must change in a uniform way. However, rents at the bound- 
ary cannot change: There is no change in the aggregate demand for 
community land, so the boundary rent will not change in either 
isolated or adjacent communities. Hence there is no capitalization. 
Thus we can identify one case which formalizes the assertions of 
Hamilton (1975). 

We can introduce further choice into the community and obtain the 
same results as long as we are willing to restrict the degree of diversity 
and the degree of complementarity of tastes. For example, suppose 
that tastes are separable and similar in that preferences of all agents 
can be represented in the form U = U'(q,g) + 0(@) + Ti — Ti - 
S(g.s) — rs@, where 7 indexes incomes class, as before. Now, if it should 
turn out that the optimal choice of g is independent of s (so that there 
is no effective differential choice within the town), we can again argue 
that there will be no capitalization. Clearly agents still make the same 
choices regardless of income class. Thus, rents must still adjust so that 
everyone is indifferent as to where they live (Vi is independent of s). 
This condition must be true both before and after the project, so 
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(d/dq)V; must be independent of s. Differentiating and using the 
envelope theorem, this condition becomes that 


dr, 


a. 
= U(g,2;) + &, 
dq V (9-8) a (2) 





is independent of s. But since we assumed that g; was independent of 
s, the first term of (2) is independent of s, and we are back to the 
statement that the rent structure must change uniformly if at all. In 
fact, it cannot change at all, because there is no change in the exten- 
sive demand for land (given separable preferences). 


C. Sufficient Conditions for “Full” Internal Capitalization 


Internal capitalization can take a number of different forms, de- 
pending on specific assumptions made concerning the boundary con- 
ditions, the type of taxes imposed, and the nature of property 
ownership. We will delineate these cases as we go along. As will be 
seen, all of the propositions reported here rely exclusively on the 
intensive land margin to “enforce” capitalization. There may be some 
very special functional forms which will generate systematic capitali- 
zation from the extensive margin, but these would seem too special to 
be of much interest. 

There are two principal assumptions which taken together 
guarantee that project benefits will be translated to land rents through 
the intensive margin. The first is that all the benefits of the project 
must be “intramarginal” in that boundary residents are marginally 
unaffected. We would expect this condition to hold if marginal resi- 
dents choose g = 0 (so that there is a complete revealed range of 
choice within the community). But the condition they may hold more 
generally; for example, in the case of a museum or park, “boundary” 
residents may make so few trips that, at their current level of activity, 
they could make no better use of a larger facility. We formalize this 
condition as follows. Let o stand for some boundary region. Then 
“marginal indifference” will mean that 

aCe ee ee ~ 7, €4] = 0. (3) 
aq Ulq,80,€5,1' —T' —f(g.0) — ro€6 

The second necessary assumption is that residents do not tend to 
sort themselves out within the town according to their relative prefer- 


2 This case is closest in spirit to the example of no capitalization given in Polinsky and 
Shavell (1976). It also corresponds to Hamilton’s (1975) model of Tiebout communities 
in which there is no internal spatial structure. As we show below, Hamilton’s conclusion 
that there would be no capitalization in Tiebout communities depends critically on the 
assumption of no internal spatial structure. 
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ence for the public good. Since systematic differences will generally 
lead to sorting, this means that the community must be reasonably 
homogeneous in attitudes toward public goods. We will see later that, 
when sorting occurs, it will tend to mitigate capitalization. 

The required homogeneity will result whenever agents are addi- 
tively separable in their preferences on any component of consump- 
tion concerning which they systematically differ. Here we will treat 
the case where individuals differ in incomes and have preferences 
which are additively separable in the associated numeraire consump- 
tion good. 

Consequently, we now assume that a resident (7) of the community 
has preferences which can be represented in the form U‘ = U(q,g,€) + 
I' — Ti — f(g,s) — rs€. Now, it is obvious that residents will not sort 
themselves out by income class. Conditional on living ats, all residents 
would make the same choices of g and € (gi = gs, €5 = @s, alls), so at 
any specified rent structure, they would all agree on the best location; 
hence, as before, the rent structure must adjust until all locations are 
equally desirable for all residents. 

We now evaluate the first-order welfare effect of a new project: 


dw = Lo,dV'. (4) 


Since we are ignoring distributional factors, we must assume that 
whatever income differences prevail are “optimal,” implying that the 
welfare weights (w;) are equal (otherwise, pure transfers could be 
made in such a way as to improve net welfare). We normalize units by 
setting the common weight equal to one. In evaluating (4) we treat 
everyone as if they were living at the boundary region o and take into 
account all potential effects which gq could have on the parameters 
faced by consumers. 
Performing the differentiation, applying the envelope theorem 

where appropriate, and aggregating where possible, we have 

dw 0 dra mtn dy eu dilas indie ag4t) 

de Tope ell Nao angered ere 
where N stands for the total population of the community and the 
income variables without superscripts stand for aggregates over all 
residents. Now, marginal indifference (3) means that the first term is 
zero, and dY/dq must be zero as well, since Y cannot change under the 
assumption that nonland private goods prices are fixed. Furthermore, 
government budget balance implies that dT/dq = dl'/dq. Making these 
substitutions, we have, finally, 


qWw_ dR _ dV, ail dre 
dq dq dq dq 


Noo Ge (5) 
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The formula (5) can be thought of as a generic capitalization result 
from which a variety of specific relationships can be derived. 


1. Local Ownership 


Let us first look at the case where local firms (as well as local land) are 
all owned by local residents. In that case II should be thought of as 
total profits generated in the region. Assuming that local firms behave 
competitively, we know that dII can be written in the form dII = X - 5p, 
where X is the vector of net outputs and p the private goods price 
vector. But the only prices which change are land prices, so we have 
d\I/dq = —(dR;/dq), where R; is the value of land used by firms. 
Substituting this relationship into (5), we have 


AWE ngdR teind ister psd 5 


dq. dq dq o dq’ (6) 





where R,. is the value of residential land. 

Alternatively, we could take the view that firms exhibit constant 
returns to scale and are constantly in long-run equilibrium. In this 
case, the above analysis is invalid since any infinitesimal change in 
prices will lead to a discrete jump in the firm’s decisions; in particular, 
if rents go up, ceteris paribus, all firms go out of business. The correct 
analysis would then set dII = 0 in (5) and conclude (as in most of the 
previous examples in the literature) that capitalization involves all 
land value. Upon reflection, it seems that the long run is too long to be 
relevant here. Firms are not free to leave when rents rise, and the 
increasing rent payments by firms should be counted against profits. 
Hence we will stand by the formulation (6). 

The final form capitalization takes depends on the boundary con- 
ditions. In the isolated case, it seems reasonable to suppose that dr,/dq 
= 0 even though the boundary of the town may shift. Conceptually, 
we can think of there being many boundary regions which are shared 
by residents and farmers. As long as some of these regions are still 
shared after the project is initiated, boundary rent cannot change. 
Thus for the case of isolated communities, we assert dW/dq = (dR,/dq) 
— (dT/dq). Verbally, the gross benefits of the project are capitalized 
into residential land values. Net benefits (dW/dq) are gross benefits 
minus costs (dI'/dq). 

A more general statement can be made which will apply to adjacent 
communities as well. If the aggregate demand for land increases so 
that dr,/dq > 0, then the value of residential land overcapitalizes gross 
benefits ([dR,/dq] — N€,[dr,/dq] exactly capitalizes gross benefits) 
while if dr,/dq < 0, then the value of residential land undercapitalizes 
gross benefits. 
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2. National Ownership 


The majority of firms are not locally owned. Indeed, most firms are 
national in the scope of their operations, and local ownership has no 
meaning for such firms. Suppose we take the position that all 
ownership is in national firms. What happens to the analysis? Ac- 
tually, not very much. Indeed, the formula (6) is still correct for 
society as a whole, but now some of the costs of the project are paid by 
“foreigners” who own shares in the firms which are operating locally. 
Indeed, if we take the position that each community is small relative to 
the country, then it is a good approximation to assume that local 
branches of national firms are owned entirely by outsiders. In that 
case, it follows that gross benefits to residents of the community are 
capitalized into total land value, while gross benefits to society as a 
whole are capitalized into residential land values. The difference 
represents an externality. We will discuss other examples of such 
externalities later. 


3. Property Taxes 


Up until now we have dealt with only the case in which taxes were 
treated as lump-sum taxes by consumers. Since the major locally 
imposed tax is a property tax, it is important to see how the analysis 
needs to be modified in that case. If an ad valorem tax at rate t¢ is 
imposed, then an agent using an amount of land @ in zone s will pay 
taxes tr,€, where r, must now be thought of as the rent net-of-tax 
payments. 

Now, the problem for a typical individual can be stated as maxy,¢,; 
[U (q,g,€) +I'—f(g,s) —rs(1 + t)€]. Clearly, it is still true that all agents 
will make the same choices, and, indeed, the method of analysis is 
exactly as before. Performing the calculations (and taking the point of 
view of society in order to avoid any ambiguity concerning the 
ownership of firms), we derive 


dw dR dt 
—— =—— - Nr fo 
dq dq dq 


where R,. now stands for total residential rent net of taxes. 

The exact form of capitalization again depends on boundary con- 
ditions. And now there is some ambiguity concerning these even in 
the case of isolated communities. The issue revolves around whether 
farmers in the boundary regions do or do not pay the taxes. If they do 
pay the taxes, then the free boundary condition is as before. However, 
if they do not pay the taxes (but would have to if they acquired 





dr 
—N€,(1 + t) 4, 7 
(+07 (7) 
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property “within” the community), then the appropriate condition is 
s(1 + t) = r, where r is the opportunity cost of land to a farmer. We 
will refer to this arrangement as agricultural zoning and to the other 
case as blind zoning. 

The case of agricultural zoning is easiest to analyze. Differentiating 
the boundary condition with respect to g, we have (1 + t)(dr,/dq) + 
r,(dt/dq) = 0. Therefore, referring back to (7), 


dW _ dR, 
dq dq” 


Here net project benefits are capitalized into net-of-tax residential 
property values, while gross project benefits are capitalized into 
before-tax property values. 

For the case of blind zoning, the term involving dr,/dq vanishes in 
(7), and we are left to evaluate Nr,€,(dt/dq). Suppose that only residen- 
tial land is taxed, so that the government budget balance condition is 
iR, = I. Differentiating this equation with respect to g and substitut- 
ing for dt/dq in (7) yields 


dq dq dq dq 


where a = Nr€,/R,. The net benefit is now a weighted difference 
between increases in after-tax residential property values and in- 
creases in cost. Note that the smaller is a, the closer we approximate 
the results for the previous case. The a@ stands for the ratio of the 
value of land residents would use if they all lived at the boundary to 
the actual value of residential land. 

More generally, however, (9) tells us that increases in residential 
property value overcapitalize net benefits by an amount related pos- 
itively to the gap between extra costs and the extra revenue that would 
be generated without changing tax rates. Clearly, this result is inter- 
mediate between pure gross capitalization and pure net capitalization. 





(8) 


dR, dv 
—t*-a 








)= (+ at) 


D. Preference Differences within the Community 


Once systematic preference differences are introduced into the com- 
munities, the exact capitalization results tend to break down. Here, we 
demonstrate this and indicate the type of modification which is re- 
quired. For this purpose, let us consider the simplest extension possi- 
ble, one with two different types of residents in the town (indexed a 
and b). Let there be N, (N,) a-type (b-type) residents. It is convenient 
to assume that all a-type residents have the same preferences and 
income (although different incomes could be incorporated as before). 
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Also, we find it necessary to suppose that the plot size within any given 
zone must be uniform, and we may as well assume that this plot size is 
predetermined. 

Now we let S“ stand for the set of zones occupied by a-type residents 
and S° for the corresponding set of zones for b-type residents. Obvi- 
ously if o is included in both sets, the analysis is exactly as in previous 
sections; indeed, all previous theorems go through with preference 
differences as long as those differences do not lead to systematic 
sorting. Therefore, without loss of generality, we can assume that the 
zoning arrangements are as in the Venn diagram shown in figure 1. 

Since the zoning is discrete, there is little loss of generality in 
assuming that at least one zone is occupied by both types. We let 7 
stand for such a zone. 

In what follows, it is convenient to use the shorthand notation F¥(q) 
es U*G.c2t.) — f(e7,s), alls, x = a. Thus, V?@ can be written in the 
form V¢ = F%(q) — 7,.€, + I*. Next we evaluate welfare as before, 
except that now we cannot evaluate everyone’s welfare at o; instead, 
we evaluate a-type welfare at o and b-type welfare at r: 


W=N,Vi+N,V2, (10) 
=iN Hoot Np Fo tela INer, CgiaeN otal 
Finally, we make use of the fact that a-type agents are indifferent 
between locations 7 and a. This indifference implies that 
1,€, =1ols + Ftq) — Fog). (11) 
Substituting (11) into (10) yields W = NF%q) + NoLF2q@) — F2@)1 +7 - 
Nr,€,, and differentiating with respect to g, we obtain: 


dw dU? diotend lew dRaedh 
—— = N ——(q,¢2,€,) - N€@, —2 + —+ — - 

dq dq Seca) dq dq dq _ dq (3) 
dU* ee 

is No | dq q,g2.€,) az dq_ (q.gr | 





Examining (12), we see that the terms are just as before except for 
the intramarginal benefit term: 


dU dU 
N | q, Oe 7, aN, wee } 
ada q,87,€;) dq V8") (13) 





We will argue that there is a strong presumption that this term is 
positive. To begin with, the condition g? > g? is a stability condition for 
the town structure we have specified. It guarantees that there is a 
differential benefit to an a-type agent relative to a b-type agent as one 
moves toward the boundary of the town (b-type agents gain relatively 
more from proximity to the center since they make more trips). 
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Therefore, as long as the marginal benefit from more public goods 
increases with the number of trips, the intramarginal term (13) must 
be positive. 

Thus, when systematic preference differences are introduced into 
any particular variant of the model studied above, land rents will tend 
to undercapitalize the associated benefit. Operationally, this means 
that if land rents were used as a decision criterion, we might end up 
rejecting projects which ought to be accepted. 


E. Sufficient Conditions for No External Capitalization 


Having explored the conditions for internal capitalization in a com- 
munity that was insulated from immigration forces, we turn now to a 
study of those external forces. We will show first that any systematic 
differences between towns can serve to neutralize the external forces 
at least with respect to their effect on small projects. There is an 
important caveat here, however. What we show is that communities 
could arrange things so as to insulate themselves; we cannot show that 
they have an incentive to follow such policies. Indeed, we have shown 
in some related work that communities do not have a myopic incen- 
tive to insulate in many instances.’ We will ignore this issue here. 
We employ the simplest model of community here, in which there 
are no choices with regard to public goods use. The reader might 
usefully recall that this is one of the cases in which there is no internal 
capitalization. All agents within the town are assumed alike. To make 
the point about potential insulation m the strongest possible way, we 
will also assume that preferences of all individuals in all towns are 
alike and that towns differ only in income levels; that difference alone 


3 See Starrett 1977. 
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is enough to insulate towns. Actually, this case may not be so unrealis- 
tic, since sorting by income level is the clearest observed Tiebout 
differentiation among communities. 

Referring back to the first model in Section ILA, we can specify the 
characteristics of community k completely by a level of public goods 
(q*), an income level of residents (£)4-and a costlevel (* — 7 * = 
f'lz,s] +r*@,) which must, at equilibrium, be independent of location 
(s). It is convenient to assume that the g and € levels are the same 
everywhere so that they can be deleted from the analysis, although 
this is not really necessary. Note that, in the model as now constituted, 
there is no economic distinction between property taxation and 
lump-sum taxation. Now, the welfare (UF*) of an individual with 
income /* living in his own community can be expressed as U** = 
U (gr Ihe ht): 

We now ask the following question. Suppose that a set of q’s, I’s, 
and h’s is specified. Will anyone have an incentive to move immedi- 
ately, and if not would anyone see such an incentive if one of the q's 
were changed marginally? We will show that an appropriate initial 
choice will make the answer to both these questions “no” as long as 
income effects are not negligible. Suppose ak agent considers moving 
to j. We assume that, if he does this, he must pay the taxes (and other 
costs) appropriate to j. (If, instead, he expects to pay the taxes appro- 
priate to k, as he surely would if the tax were a property tax, the 
results are still the same, but we would have to incorporate the taxes in 
the J term rather than the f term.) On the other hand, an agent’s 
income is determined by his place of initial ownership. This must be 
true of property income (as long as there are no unanticipated capital 
gains), and labor income is the same everywhere, by assumption. 
Hence, he would expect to get utility U” = U(q’,I* — h’). 

The move is not desirable, and would not be desirable for any 
sufficiently small variation in q’, as long as U** > U™. Similarly, a move 
from tok is not desirable as long as U* = Ug", — h*) <U@,P — hi) 
=U". Are these two conditions consistent? Let us order; and k in such 
a way that q* > q’. Then stability certainly requires h* > h’. To see what 
else is required, it is convenient to change variables slightly. If we 
define Ag = qi — q®,z* =I* — h*, 25 =) — h®, and Az = h* — h/, then the 
two stability conditions may be written as U(g*,z*) > U(q* + Agq,z* + 
Az) and U(qg*,z’) < Uq* + Aq,z + Az), with Aq < 0 and Az > 0. These 
conditions are consistent as long as /* > I, as can be seen in figure 2. 
Of course, we have assumed in the construction that the income effect 
is normal and significant, so that increases in income increase the 
marginal rate of substitution of z for g. Clearly, community k could 


4 The J* is clearly the average income in the community. 
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engage in any project of size less than or equal to 59" without inducing 
entry, while a similar statement holds for community j. 

Naturally, if projects undertaken are too large, entry may be in- 
duced. However, we assert that any systematic differences between 
towns (direct preference differences would have done as well) can 
imply no external capitalization of marginal projects. This same 
threshold effect would result if the segregation of communities were 
caused by a fixed cost of moving (rather than by preference differ- 
Ences).2 

It is interesting to note that if we now reintroduce the possibility of 
internal capitalization, its presence may be enough to neutralize ex- 
ternal forces, even if there are no intrinsic differences at all among 
people. That is, if g* > q’ implies R* > R’, we will automatically have /* 
> fi, assuming that there are no other income differences. 

We chose income differences to illustrate the possibility of insula- 
tion because they were the most innocuous-sounding differences we 
could think of. Obviously, preference differences are more likely to 
serve this purpose.® Indeed, some might argue that income effects are 
unimportant in the context discussed here. Suppose that we reexam- 
ine our problem in the absence of income effects. Interestingly, the 
whole character of the analysis changes. 


° Of course, if there are no costs of moving and a continuum of preference types, any 
project will induce some entry and, presumably, some capitalization. 

§ Since the Tiebout model is characterized by preference difference among com- 
munities, there will be no external capitalization of small projects in Tiebout com- 
munities that are “sufficiently” differentiated. 
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F. Sufficient Conditions for Full External Capitalization 


Without income effects or preference differences between com- 
munities it is impossible to insulate, and the resulting migration forces 
will generate external capitalization. To show this, let us reintroduce 
preference differences within communities while at the same time 
eliminating any income effects or preference differences between 
communities. An a-type resident of income classi now has preference 
of the form V% = U(q) +I‘ —h. At this point, it is convenient to define 
a fictitious “average” resident of community k. This average resident 
clearly would have preference of the form V* = U*(q*) + 1* — hk, 
where /* is the average income level in k (as before) and U* is the 
average felicity function in k. Our assumption of no preference dif- 
ferences between communities will be taken to mean that the average 
felicity functions are the same in all communities, that is, U¥(q) = 
U (q), all k. 

Now let us reconsider the stability conditions for equilibrium. 
Clearly, a necessary condition for stability is that our fictitious average 
resident does not want to move in either direction between any pair of 
communities. The corresponding pair of inequalities for communities 
k andj are 


O(gt) + Tk — hk > O@) + 1k — bi (14) 
and 
Ut) +B — hk < Og) + Bhi. (15) 


Clearly strict inequality in either direction is impossible, and both 
relations must hold as equalities in any equilibrium situation; the 
average resident must be indifferent at the margin as to where to live. 
It follows that any welfare-improving project anywhere must induce 
entry. 

We can now use equations (14) and (15) to generate a simple 
external capitalization result. These equations must hold both before 
and after a project (in some particular town z), so we can differentiate 
them with respect to g?. In doing so we must allow for the possibility 
that a change in q* could affect J’ and h/ (j # z). We have already seen 
one way in which this could happen through external ownership of 
local firms; now that migration is certain, there are the so-called fiscal 
externalities as well; when one community attracts residents from 
other communities, the lost tax revenue is a real external cost to the 
“losing” communities. For a rigorous demonstration of this fact, the 
reader is referred to Starrett (1980). 

Allowing for potential changes in all variables, we can write 


dV = dU (q?) + di? — dh? = di* — dh’ (16) 
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and 
dV} = dU (q?) + di! — dh? = dl — dh’, (17) 


where dV' is shorthand notation for the change in average welfare of 
community 7. 
Let us define dW* to be the total external effect; that is, 


dW* = SNidVi = dI*# — ¥ Nidhi? 


j j#s 


And dW? = N?dV* will stand for the welfare change in community z. 
Clearly,dW = dW# + dw?. Simple substitutions yield the following two 
equations for the change in community z welfare and the change in 
social welfare: 


N* N? 
dw? = we WF a N# (dR# Si dII*) ia dR? ta all? 
and 
N N? 
dw = we a? — NF (dR* ota dII*) + dR? aia dil?. 


Finally, we must distinguish cases again, according to the nature of 
firm ownership. 


1. Neutral Ownership 


Suppose that each community owns its share of national profits. Then 
N?II*# — N#TI# = 0, and we have 


N2 N? 
and 
N N 


Clearly, total rent change in z is a good approximation to net welfare 
benefits in z as long as community z is small relative to the nation (so 
that N*/N*# is small). However, net social benefits will be misrepre- 
sented by rent increases to the extent that the external effect is 


significant. 
How important is this misrepresentation? It turns out it may be 


’ The J‘ stands for total income in community 7; the symbol # means the associated 
variable is aggregated over all communities except z. Recall that we are assuming that all 
local public goods projects are paid for locally. Naturally we could correct for the 
externalities if we could impose the optimal revenue-sharing scheme. 
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quite important even if N*/N* is small. We have studied the measure- 
ment of dW*# elsewhere (see Starrett 1980) and merely report the main 
results here. 

When community z engages in a project, that project is certain to 
induce some entry. Other communities suffer real welfare losses 
which are equal to lost tax revenue (from the migrators) minus any 
attendant improvement in congestion “costs.” Unless communities 
have already seriously overexpanded, the net benefit to outsiders is 
negative (dW* < 0). Hence even changes in residential land values will 
overstate net social benefit, quite generally. 

Of course, if community z is not small as a fraction of the country, 
the theory takes a somewhat more complicated general form. Aside 
from the external terms, net benefits are capitalized into the differ- 
ence between the change in land value in z and the “average” change 
in land values elsewhere. This formulation is reminiscent of that 
presented in Strotz (1968) for two regions. 

There is one other major difference from the internal capitalization 
results: net benefits are externally capitalized, while gross benefits 
were internally capitalized. The economic reason for this discrepancy 
is not difficult to find. The marginal potential resident pays local taxes 
in the internal model (he is the boundary resident), while the marginal 
potential resident does not pay local taxes in the external model. 


2. Local Ownership 


If each community owns its own firms, then dII* = —dR%, and we have 
e— N* z ING 
and 
N _ N? 
dW = —, dW* + dRi — ~dRf. (21) 


The form of the results is the same except that capitalization is into 
residential rather than total land value. The reader might think at 
first that there is a contradiction between equations (19) and (21). 
After all, it cannot matter from a social point of view where the profits 
are owned, so why do these equations look different? The answer is 
that any increased cost of local industrial land will be reflected in the 
term dW* in equation (19) but not in (21). Thus, even in the case of 
national ownership, only increases in residential land values represent 
real social benefit, once adverse land value externalities are canceled 
out. 
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III. Summary 


We have shown in the previous sections that there is considerable 
diversity in the form and extent of capitalization, depending on the 
nature of the assumptions made. However, some fairly general prin- 
ciples emerge, and we summarize these in this section. 

First, all of the models studied lead to the conclusion that, from the 
point of view of society as a whole, it is residential property values that 
capitalize project benefits (rather than total land value). The theories 
differ on the appropriate land base only from the perspective of the 
project-generating community; if lost profits from increased land 
values are exported, then increases in total land value measure 
benefits to the community, but, if not, then it is residential land values. 

The internal and external capitalization models generally disagree 
on whether gross or net benefits are capitalized. However, it is worth 
pointing out one important case in which the two theories agree on 
this issue. This is the case of property taxation with agricultural 
zoning, for which both theories suggest that increases in net-of-tax 
land values capitalize net welfare benefits (and, naturally, increases in 
gross-of-tax land values capitalize gross benefits). 

The main differences between internal and external capitalization 
results can be traced to the boundary conditions imposed on a repre- 
sentative community. Notice that such boundary conditions play no 
role at all in external capitalization; indeed, the external results hold 
regardless of what happens to rents or benefits at the boundary of the 
communities. 

However, internal capitalization results vary considerably, de- 
pending on the assumptions governing behavior of land rent and the 
impact of incremental projects, at the boundary. In the extreme case 
in which the impact is nil and the (net-of-tax) rent goes down by the 
rate of taxation, internal capitalization agrees with external capitali- 
zation. But if boundary rent is unchanged (as it will be under many 
plausible conditions) the gross project benefits are internally 
capitalized. And if marginal projects actually make boundary resi- 
dents worse off (because of a dominating congestion effect) then land 
values will overcapitalize even gross benefits. 

We should recognize that, in all cases considered, land values will 
fail to capture some of the external benefits (or, more likely, costs) of a 
project. When the only external effect is through nationally owned 
profits, as was the case in the pure internal capitalization model, we 
could correct for the externality by using residential land value rather 
than total land value. However, in the free-trade, free-migration 
model of external capitalization, the external costs of a project are 
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generally much larger, and we conclude that increases in residential 
land value will overstate true net social benefit by an amount closely 
related to the size of the fiscal externality. 

We close with some remarks on the basic assumptions underlying 
capitalization results. The two types of capitalization rest on different, 
and to some extent complementary, assumptions. External capitaliza- 
tion requires that the distribution of agents by economic characteris- 
tics be the same in one community as in all others, while internal 
capitalization requires that agents within a particular community must 
have similar economic characteristics (though these characteristics 
may differ from community to community). Thus, the internal 
capitalization results are most applicable in a “pure Tiebout” world 
where each community specializes and attracts a single type of agent. 
The external capitalization model is most applicable in the opposite- 
extreme world of “pure scrambling” in which all communities look 
alike in terms of the characteristics of agents. Naturally, there are 
many intermediate cases between these extremes (including the real 
world, presumably!). To the extent that neither extreme is a good 
approximation, capitalization will not hold in either form, or will hold 
only partially. 
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Money and the Dispersion of Relative Prices 





Zvi Hercowitz 
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A price dispersion equation is tested with data from the German 
hyperinflation. The equation is derived from a version of Lucas’s 
and Barro’s partial information-localized market models. In this 
extension, different excess demand elasticities across commodities 
imply a testable dispersion equation, in which the explanatory vari- 
able is the magnitude of the unperceived money growth. In order to 
test this hypothesis a price dispersion series is constructed, and a 
measure of the unperceived part of money growth is estimated. The 
model receives support from the empirical analysis, although it is 
evident that unincluded variables have important effects on price 
dispersion. 


The existence of a positive correlation between absolute price level 
variability and the dispersion of relative prices was observed by, for 
example, Mills (1927), Graham (1930), and recently by Vining and 
Elwertowski (1976). In his study of U.S. price behavior during the 
period 1920-26, Mills (1927) says: “We have not however exhausted 
the possibility of discovering a relationship between price level and 
dispersion. It may be that dispersion depends upon the violence of 
the price change, regardless of direction” (p. 284). Graham (1930) 
finds in the post-World War I German hyperinflation an additional 
dynamic element of price behavior: “It is clear that with the initiation 
of an upward movement in general prices a series of lags in individual 
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prices developed, that these lags tended quickly to disappear when 
stability of general prices was reached on a new level, or when general 
prices fell, but that they were nevertheless progressively eliminated 
even though the general price level continued to rise” (p21 75)T his 
observation suggests that unexpected events may have an important 
role in the determination of price dispersion. Individual prices dis- 
perse at the beginning of an upward swing in the price level when the 
acceleration is presumably unexpected. As inflation continues the 
element of surprise wanes, and prices tend to converge. 

The studies cited above failed to offer an economic rationale for the 
observed statistical correlation. Recently, a theoretical explanation of 
the relationship between price level variance and relative price disper- 
sion was offered by Barro (1976). Using a localized markets 
framework of the type described by Phelps (1970) and employed by 
Lucas (1973), Barro links the dispersion of relative prices to the 
variance of the money supply. The key elements of this model are, on 
the one hand, individuals possessing incomplete current information 
and, on the other, demand and supply in each market reacting to 
relative prices as they are locally perceived. Thus, agents are con- 
fronted with the problem of determining whether locally observed price 
movements are caused by general inflation or by shifts in relative 
excess demand. The larger the variance of the money supply, the 
more likely are agents to attribute local price movements to general 
inflation rather than to relative shifts. Accordingly, as the money 
variance rises, local price changes induce smaller supply-and-demand 
responses—that is, excess demand becomes less elastic. Consequently, 
stochastic shifts to local excess demand produce larger changes in 
individual prices, so that the dispersion of prices across markets tends 
to increase with the variance of money. In this specification of the 
model, in which all markets have the same structure, dispersion is 
unrelated to the magnitude of realized money shocks. 

This paper modifies Barro’s framework by interpreting each loca- 
tion to be the market of a specific commodity, characterized by a 
particular excess demand elasticity. Because elasticities vary across 
markets, aggregate shocks affect each commodity price differently. 
Therefore, in this modified setup price dispersion is positively related 
to the magnitude of these shocks. 

The model also predicts that systematic or perceived money growth 
is neutral with respect to price relationships. Accordingly, a money 
shock in this model is defined to be the component of money growth 
that is currently unobservable and cannot be inferred from currently 
available information. Whereas the quotation from Graham sug- 
gested that sudden—presumably unexpected—shifts in money 
growth cause dispersion, in this model unexpected monetary expan- 
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sion disperses prices only if it is, at least partially, currently unper- 
ceived. 

The main task of this study is to evaluate this hypothesis with data 
from the German hyperinflation, a period of predominantly mone- 
tary disturbances. The period considered runs from January 1921 to 
July 1923. The vertiginous monetary expansion initiated in August 
1923 differentiates the last phase of the hyperinflation, and, thus, it is 
not included in the sample. 

The theoretical framework, presented in Section I, neglects some 
important facets of the hyperinflation. Specifically, it ignores the 
foreign exchange market and the sustained divergence between the 
internal and external values of the mark,! which obviously are related 
to relative prices. Also ignored in the main text are changes in the 
velocity of monetary circulation. However, in a brief discussion some 
general conditions are given under which price dispersion is neutral 
with respect to velocity changes. Finally, the only aggregate exoge- 
nous disturbances assumed to be affecting the economy are periodic 
infusions of new money made by the government. Real aggregate 
shocks are ignored. Empirically, they are probably of relatively minor 
importance and can be considered to be part of the error term in the 
estimated equations. 

The testing of the dispersion equation requires two important pre- 
liminary steps. First, a price dispersion series is computed in Section 
II using an interesting set of data. It consists of monthly averages of 
68 commodity prices, ranging from foods to metals, for the period of 
January 1921 through July 1923 (31 months). Data were unavailable 
for the months prior to January 1921. Next, a money growth equation 
is set up and estimated in Section III. To do so, both an information 
set available to agents economy-wide and a functional form relating 
this set to money creation are postulated and discussed. The ex- 
plained part of money growth in the estimated equation is taken as a 
measure of the perceived rate of monetary expansion. Corre- 
spondingly, the unexplained part is interpreted as the money growth 
rate that could not be perceived from the assumed information set. 
These figures, in conjunction with the price dispersion series, are 
used in Section IV to test the price dispersion equation. 

Parks (1978) has also tested a model of price dispersion using pre— 
and post-World War II U.S. data. In Parks’s model dispersion is 
explained by changes in real income and the unexpected part of 
inflation, as measured by the innovation in the inflation rate. In this 
specification expected inflation and changes in real income are 
treated as exogenous variables. He finds a strong positive correlation 


1 See, e.g., Bresciani-Turroni 1937. 
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between unexpected inflation and price dispersion. His tests also 
suggest a separate but smaller effect of the actual inflation rate. 

The present paper estimates an equation that relates price change 
dispersion to the exogenous shocks affecting the economy, in this case 
unperceived monetary injections. An additional monetary variable 
that is theoretically relevant for price dispersion is the variance of 
money shocks. An estimate for this variance is obtained from the 
money growth analysis and is included in the estimation. The model 
receives significant support from the empirical analysis. In particular, 
the variable measuring unperceived money growth has substantial 
explanatory power for price dispersion. The results also make clear 
that unincluded variables have important effects on price dispersion. 
Some of these are briefly considered in Section V. 


I. The Model 


The economy consists of an arbitrarily large number of physically 
separated markets indexed by z. In each location a specific commodity 
is produced and traded. At each date t the agents, assumed to be risk 
neutral, exchange money only for the commodity being traded in the 
market in which they are currently located. At date t + 1, agents 
change location at random, and the process is repeated. Consider now 
the information set available to the agents. It contains not only lagged 
values of all relevant variables, but also current information which is 
limited to the local market price P;(z), and some economy-wide shared 
knowledge about current variables related to money creation. Actual 
money growth, however, includes a random term which is assumed 
unknown. 

The supply and demand for commodity z assume the log-linear 
forms: 


yi(z) = a8(z)LP(z) — EP] + e(z), (1) 
yf) = —C Pz) — EP.) + (Mi — EP?) + (2), 0°(z) > 0,a%> 0. (2) 


The operator E is the mathematical expectation taken conditional on 
the information available in market z at time ¢. For each commodity z, 
Pz) — EP; is the locally perceived relative price. The expressions €{(z) 
and e#(z) represent relative shifts to supply and demand, respectively. 
The excess demand shift, €,(z) = e/(z) — e{(z), is assumed serially 
uncorrelated, normally distributed with zero mean and variance o?. 
This variance is assumed to be equal in all markets. For each z, a*(z) is 
the short-run relative price elasticity of supply. Disparity in the supply 
elasticities of different goods follows from heterogeneous produc- 
tion functions. However, in the long run relative prices are assumed 
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fixed because of perfect substitutability on the supply side. The long 
run is measured here by one period, after which all suppliers can shift 
to other markets. 

Looking one period ahead, all the markets offer the same mean 
price, but, as shown below, the corresponding variances differ ac- 
cording to the excess demand elasticities. Because agents are risk 
neutral, they are indifferent between the markets, and thus they 
choose a market for the next period randomly. There is an additional 
point related to the ex ante variability of the individual prices. Intui- 
tively, one would expect a market with more price variance to be less 
desirable because local information would yield price level estimates 
of lower precision. However, as shown below, this turns out not to be 
the case. 

On the demand side, the relative price elasticities are assumed 
constant across markets. The demand function also includes the term 
M, — EP,, which accounts for a real balance effect. 

At the beginning of each period the stock of money in the economy 
is increased by transfers from the government to the public. This new 
money is assumed to be distributed equally across the markets. Within 
each market, however, the transfers are allocated randomly among a 
large number of agents. The rate of growth of the money stock, m; = 
M, — M;-1, obeys 


mM, = 2BiXie + mM = gi + mM, 


where the X;;’s are variables (past or current) that can be observed in 
all locations and the £’s are known coefficients. The quantity m; is a 
random variable with zero mean and variance o%,. Thus, g; is the 
expectation about money growth formed from all the economy-wide 
shared information. It can be considered the prior expectation. The 
posterior is formed using the additional information conveyed by the 
local price. Thus, while g,; is the same everywhere, the posterior 
expectation Fm, is conditional on location-specific information as well 
and therefore varies across markets. 
From equations (1) and (2) market clearing implies that 


Pz) = {1 — 1/[a8(z) + a*)} EP, + {1/[as(z) + a@]} [M, + €,(z)]. (3) 


For each z, the sum a‘(z) + a7 is the relative price elasticity of excess 
demand. Let A(z) = 1/[a%(z) + a4]. Each market has a constant A(z), 
but across markets A(z) is distributed according to a given density 
function with average value \ and “variance” aX. Consistent with the 
assumption that agents possess accurate knowledge about the structure 
of the economy, this distribution is assumed to be known. 
Following Lucas (1973) and Barro (1976), the solution for prices in 
terms of exogenous variables is obtained using the method of unde- 
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termined coefficients. Given the log-linearity of the model, the solu- 
tion for the aggregate price level has the form 


P; = T1,M;_; a Il,¢, ap II,7,. (4) 


Namely, the aggregate price level will be related to the current money 
stock, which is divided into its different components. Lagged values, if 
added to (4), yield zero coefficients. Since M,_, and g: are fully per- 
ceived at time ¢, taking the expectation of both sides yields 


EP, = IL,M;_, + Ig, + TEM. (5) 
The conditional expectation of m; is now computed. Rewrite (3) as: 
By(z) = ge + my + e(2), 
where 
6(z) = [1/A(z)IPAz) — [1/A@) — IEP, — M,-,. 


The total disturbance affecting market z, 6(z), is partly nominal and 
partly real. Agents perceive 6(z) and form their expectations about 
its components. Given the stochastic specification of m, and €;,(z), the 
mean of the distribution of m, conditional on 6(z) is 


Cr 
Em, = gt + ot + git) =o); 


or 


2 
Em, = g, + Pade [ray + €,(z)]. (6) 
Observe that A(z) does not appear in (6). Since agents located in z 
know this elasticity, they are able to isolate the composite disturbance 
independently of A(z). Thus, while the ex ante variance of prices 
depends on the particuiar elasticity (this follows from eq. [26] below), 
the precision obtainable from the local information is independent of 
A(z). Indeed P;(z) would convey less valuable information in higher 
price variance markets if the differential variability was due to a 
disparity in 02. In this model, however, o? is the same across markets. 
Substitute now (6) into (5) and the resulting expression for EP, into 
(3) to obtain 


2 
EP, = TM; + pgs + ees Line + €r(z)I, (7) 


2 Lm, + €,(z)] 


2 
P,(z) = [1 — rA(z)] {Me + Ihg, + Is ee (8) 


+ A(z)LMiy_-1 + ge + me + €(Z))]. 
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A new expression for the general price level can be computed from 
(8) by averaging with respect to the densities of X(z) and eé,(z): 


om 


P; — ( = d) (Me. aP Il,g; + II, rt Se A(M i oe &t a mt). 


(9) 


Since equation (9) is identical to (4), the solution for II,, I,, and IT, is 
obtained by equating the corresponding coefficients in the two 
equations: 


Il, ad i; 

i (10) 
2 2 

i Om + Oe 


62, + (Id)? 


Substituting (10) into (8) and (9) and rearranging terms yields the 
solution for the individual commodity price and the average price 
level: 


o2, + A(z)(/A)o?2 


P,(z) aa My tae a o2, fe (1/A)o2 


[m, + €,(z)], (11) 
Cait Cs c 


Oe ae alee EAL eae 


(12) 


The resulting actual relative price is: 
Pi(z) — Py = (1 — @)A(Z tg + [6 + A(z)(1 — A)]eA(z), (13) 


where \(z) = A(z) — A and 6 = o2,/[o2, + (1/A)o?]. 

The hypothesis expressed by equation (13) is that only the unper- 
ceived part of money growth can affect price relationships. Note that 
the realized values of the unperceived money growth appear in the 
relative price expression. This follows from the confusion between m; 
and ,(z). Since in general Em, # m,, part of the money shocks is 
mistakenly perceived to be a shift in relative excess demand. The 
ensuing short-run supply reactions differ across markets according to 
a‘(z), thus causing dispersion among actual prices. On the other hand, 
g: 1s correctly identified as an aggregate disturbance and therefore 
cannot be confused with a relative shift of excess demand. The neu- 
trality of perceived money follows from y{(z) and y#(z) being functions 
of the relative price and from the one-to-one relationship between g; 
and the expected price level (eqq. [7] and [10]). Given some value for 
gt, the quantities along the supply-and-demand schedules are the 
same as before, for local nominal prices higher by an amount equal to 
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the adjustment in EP,—which equals gt. Therefore, the market clears 
at a P,(z) which is higher by the same degree in all markets 

The variance of relative prices at time ¢, defined as 7? = 1/N Se, 
[P,(z) — P,}?, where N is the “very large” total number of markets in 
the economy, can be computed from equation (13)3 


tT = {(1 — 6Po% + [6+ A(1 — 6) P}o2 + (1 - 0)?aim?. (14) 


An empirical test of this equation requires a measure of dispersion 
among prices or price indexes of different commodities. Mills (1927, 
chap. 3) discusses problems that the interpretation of this dispersion 
measure presents. For example, long-run differential technological 
changes will cause prices to disperse over time. One would like to filter 
out such effects, because the focus here is on short-run distortions 
caused by incomplete current information. The problem is alleviated 
by using rates of price change rather than price levels. Different trends 
do not affect the variation of price change dispersion over time— 
although alterations in these trends will. Thus, some of the long-run 
relative prive movements effects can be filtered from the dispersion 
measure. What remains can be considered to be captured by the 
random term in the dispersion equation. 

The variance of the rates of change in individual prices is calculated 


using equation (13) and the equivalent for ¢ — 1. This variance, 
defined as 4 
1 
Me =, > Pa) PG) — (PP ), (15) 
z=1 
follows as 


ve = 2(1 — 6Pogoz + 2[6 + A(1 — 6)?]o2 + (1 — 6)?aR(om, — m,_,)?. 


Equation (15) is the final price dispersion equation that is generated 


* In order to consider the effects of changes in the velocity of money circulation on 
price dispersion, I worked out a similar simple model in which there is some current 
public information about future money growth. This information may be conveyed by 
political or military events that are believed to have implications for the future state of 
government finances. The prediction of future monetary expansion, which generates 
inflationary expectations, affects the velocity of circulation in the current period. With 
respect to relative prices, if the knowledge about future money growth is shared 
economy-wide, they will be unaffected by the change in velocity. This neutral effect 
follows from the same mechanism determining the neutrality of perceived money. 
Since all agents share the same knowledge and are assumed to use it in the same model 
to predict its effects, they will equally adjust their EP, according to the change in 
velocity taking place. A “one-time jump” in all prices therefore occurs, without affecting 
their dispersion. ; 

3 For this computation, since ¢,(z) and [e,(z)}? are independent of X(z), [A(z)P, and (z), 
the following equalities are used: (1/N)Ze,(z)-A(z) = 0-A = 0, (1/N)Z lez) PIA) = 
aeax, and (1/N)2[e(z)PIA(z)P = o¢(ax + d’). 
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by the model. Because it deals with the dispersion of price changes, 
the appropriate monetary-shocks variable is the magnitude of 
changes in m,. 

Consider next the implied relationship between the variance of 
money shocks and the dispersion of relative prices. Barro’s theoretical 
result was that 0%, is positively correlated with relative price variability. 
However, the effect of o?, is ambiguous in this extended version of the 
model, since it has different and opposite effects on the three terms in 
the y? expression. The second term on the right-hand side of (15) is 
the remainder of the expression when all markets are alike; that is, 
when, as in Barro’s case, all have the same excess demand elasticity (aX 
= 0). This term corresponds to his relative price variance, which 
depends positively on o?, when 0 < A < 1. This condition is the 
counterpart to Barro’s assumption that substitution effects dominate 
wealth effects. 

The first term accounts for the positive interaction between the 
diversity in elasticities and the strength of the relative shifts. A term of 
this sort would be included also in the dispersion expression under 
full current information. In the present case of partial information, 
the fraction (1 — 6) appears here because agents typically underesti- 
mate the magnitude of the relative shifts, thus diminishing their effect 
on price dispersion. Because this underestimation increases with o7, 
the first term is negatively related to the money variance. The other, 
more interesting negative effect of o%, appears in the third term, 
namely, in the coefficient of (m; — m;_,)*. If a7, increases—or more 
precisely when the public perceives it doing so—money disturbances 
are less confused with real shifts, implying that a given shock induces 
smaller dispersion. This effect is a relative price equivalent of Lucas’s 
hypothesis about the link between the variance of the nominal distur- 
bances and the slope of the Phillips curve. 

In the testing of equation (15), reported in Section IV, an attempt is 
made to capture the different effects of o?, and its net influence on 
price dispersion. However, the procedure adopted does not indicate 
that shifts in o7, have an important effect. 


II. Construction of the Price Dispersion Series 


This section reports the computation of a measure of price dispersion 
for the hyperinflation in Germany during the period January 1921- 
July 1923. The data set, consisting of 68 series of monthly averages of 
wholesale commodity prices, is obtained from the German statistical 
yearbook issues of 1921/22 and 1923 (see Statistisches Reichsamt 1921/22 
and 1923). Other series from this source, some reported only until 
December 1921 (seven commodities) and others beginning only in 
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January 1922 (21 commodities), were deleted in order not to intro- 
duce a bias due to changes in the sample size and composition. 

Prices are quoted from commodity exchanges of several German 
cities. Each series, however, originates in a single location. The 68 
commodities include 27 foodstuffs, 19 textiles and leathers, and 22 
metals, oils, and coals. They are not finished goods but materials in a 
rather raw state. Because weights for the different commodities are 
unfortunately not available, unweighted rates of price change are 
used. Hopefully, the wide range of commodities in the sample ap- 
proximates the general relative price instability during that period. 

The individual price rates of change are computed as the first 
difference of the logarithms of the prices. Average values and vari- 
ances are then calculated using 


1 
AP, = 37> AP in, 


1 
Yi = vv (laa (AP) 


where Pj; is the price of commodity 2 and AP = log Py, — log Piz-1. 
Table 1 contains the computed values of AP; and yi. Due to missing 
observations, the actual number of commodities included in the cal- 
culations varies slightly from month to month. The third column in 
table 1 indicates the number of commodities for which both P, and 
Piz-1 are available. 


III. Estimation of the Unperceived Part of Money Growth 


Determination of the unperceived component of money growth dur- 
ing the hyperinflation requires a specification of the information set 
assumed to have been available to the public and the functional form 
for calculating the conditional expectation of money growth. Con- 
sider the expectation conditioned on economy-wide or “global” infor- 
mation g;. This term was defined in Section I to be the prior expecta- 
tion and is distinguished from the posterior expectation because it 
does not incorporate the additional information derived from local 
price observations. 

This global information is assumed to consist of the current gov- 
ernment spending in foreign exchange units, S,, the current exchange 
rate, é,,and l-month lagged data on the money stock, price level, and 
all other macroeconomic variables. Not included is government reve- 


* Given this source, these data do not present the problem of reported wholesale 
price data in the United States, discussed by Stigler and Kindahl (1970), that they do 
not always reflect discounts from list prices. 
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TABLE 1 


MEAN AND VARIANCE OF WHOLESALE RATES OF PRICE 
Cuances, GERMANY, Fespruary 1921—Jury 1923 








Number of 





Month AP, ¥? Commodities 
1921: 
February an09 015 63 
March —.05 011 66 
April 202 .007 66 
May Oil .026 66 
June 05 .032 67 
July .04 .034 66 
August 19 .081 66 
September ag 033 68 
October 24 .034 68 
November 38 .043 68 
December ns) .062 66 
1922: 
January 05 .018 66 
February ah 013 68 
March 25 O11 65 
April alt2 .018 65 
May .04 .013 66 
June .09 .007 66 
July 36 017 67 
August .70 097 68 
September .40 071 66 
October .68 .064 66 
November 78 .038 66 
December a2 .052 66 
1923: 
January 70 .064 66 
February .69 .099 66 
March — 17 .041 65 
April ald .018 65 
May 50 .039 66 
June 82 048 62 
July 1.25 15) 63 








Source.—Based on monthly average price data from Statistisches Jahrbuch fiir das Deutsche Reich (1921/22), pp. 
282-83, and (1923), pp. 286-89. 


nue from taxation and other sources because this variable depends on 
the current level of economic activity and is unlikely to be prean- 
nounced and to be widely known contemporaneously. It is natural to 
assume that the part of government expenditure consisting of the 
reparations to the Allied Powers was known in foreign exchange 
terms. With respect to the other expenditures, the implication is that 
nominal spending was observable and could be readily converted 
given the exchange rate.° 


5 A referee of this Journal suggested an alternative procedure for isolating unper- 
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The prior expectation of money growth is derived from the govy- 
ernment monthly budget constraint, namely, 


M? — M?_, = Ste? me 
— (other forms of nominal government revenue). to) 


The superscript 0 indicates that the variables are not in logs but in 
their original form. Equation (16) indicates that creation of high- 
powered money equals the part of nominal expenditure that is not 
financed in some other way. The expression M? would correspond 
here to the end-of-month money stock. The other forms of govern- 
ment finance are taxes, net sale of bills, gold sold to the public, etc. 

If this other revenue comprised an approximately fixed proportion 
of total expenditure over time, the budget equation above could be 
expressed as 


M? — M?_, = kS%° + random term, (17) 


where k (0 < k < 1) is the average fraction of the expenditure 
financed by money issue. 

The first attempt to generate a perceived money growth series was 
made using an equation of this type. Dividing (17) through by M?_,, 
money growth appears linearly related to S%e°/M?_,. The three vari- 
ables in this ratio are assumed currently known, and, therefore, this 
specification is consistent with the notion that the conditional expec- 
tation can be formed using only currently observable variables. 

However, a regression of this form,® including a constant, shows 
that k was probably not constant over the period. Specifically, the 
existence and pattern of residual serial correlation,’ plus some addi- 
tional considerations discussed below, suggest a nonlinear relation- 
ship between money issue and spending during that period. 

Assuming, then, that the fraction k is not constant over time, the 
question is whether something can be said about its determinants. In 


ceived money using information from the Central Bank’s balance sheet. This informa- 
tion is available monthly in Statistisches Reichsamt (1925). When the Central Bank 
discounted a bill, the corresponding credit appeared in the current account of the state 
or the private borrower. Only when these funds were withdrawn did the notes in 
circulation increase. Using information about these credits, agents could forecast 
money growth in the near future. 

* The estimated OLS equation is the following (see below for the inclusion of the 
lagged spending variable): 


Pip MEA 49+ 317 SFL 4 gay See 
ai (.019) (033) MP1 (ogg) Mia 
(R? = .95, D-W = 1.0, o = .067). 


"The residuals are generally negative at the beginning and the end of the period, 
approximately the low and high values of money growth, and generally positive for the 
rest of the sample. 
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order to suggest an answer to this question, rewrite equation (17) as 
Mie Mi S? 
ttl _ = fk, ——__*____ + random term. (18) 
M? : Mle? 


Equation (18) preserves the positive correlation between money 
growth and the ratio of real expenditure to real cash balances, but, 
unlike (17), the fraction k is now allowed to vary over time. It is now 
argued that k; is itself correlated with M?/e? and S?. 

To examine this correlation, assume first that S? is fixed at some 
value S°. This level of real spending can be financed by different 
mixes of inflationary finance on the one hand, and taxation, debt 
issue, etc., on the other, where the amount to be collected by money 
issue is expressed as k,S°. In the usual diagram plotting the demand 
for real balances as a function of the inflation rate, k,S° is measured in 
steady states by the area of the rectangle defined by 4—the inflation 
rate—and M?/e?. 

Consider now an increment in m. Real balances decline according to 
the money demand function, and the revenue from inflation, k,S°, 
increases as long as w is below the rate that corresponds to a unitary 
demand elasticity for real balances. Because real spending is constant, 
k, increases, and, hence, the fraction of S° financed by other means 
declines. 

This shift from taxation to money issue can be viewed as the policy 
variable that brings about higher inflation rates. Classic works on the 
German hyperinflation, like those by Graham (1930) and Bresciani- 
Turroni (1937), describe an opposite direction of effect. Namely, the 
rate of depreciation of the currency had a negative effect on the real 
yield from taxation due to the interval of time existing between the 
occurrence of taxable transactions and the actual payment of the 
taxes.* The present discussion relies on the correlation between the 
fraction of expenditure financed by money issue and the inflation rate 
rather than on a specific mechanism relating these two variables. This 
positive correlation implies that k, and 1/(M&/e?) move in the same 
direction. However, this coincidental movement does not hold for all 
wu. When w reaches the rate that maximizes the revenue from 
inflation, k, also reaches its highest level, and when it rises above that 
rate, k; must decline. In other words, the correlation between k, and 
1/(M@/e?) turns negative in that range. 

This decline in k; implies that the revenue from other sources must 
go up. If tax collection and debt issue cannot be increased (é-0;, duéto 
the negative effect of inflation mentioned above), spending must be 
partially financed by extraordinary means, such as sales of gold from 


* See, e.g., Bresciani-Turroni 1937, p. 66; and Graham 1930, p. 44. 
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the Central Bank’s stock. In fact, the balance sheet of the German 
Central Bank shows that the stock of gold begins to decline 
significantly in April 1923, after being fairly stable since 1920.9 

Given this behavior of k, when S9 is constant, equation (18) can be 
approximated by the semilogarithmic form: 


M? >. M?1 


¥ , ] ; 
mM = constant + b’ log eae Sain (19) 


where b’ is a positive coefficient, u’ is a random term of zero mean, 
and the constant term is affected by the level of §°. In this specifica- 
tion, the implicit fraction k, increases along with 1/(M?/e?) at lower and 
middle ranges of this variable but eventually declines when real bal- 
ances fall below a certain value. 

Equation (19) acquires more empirical content if real spending is 
allowed to vary. During the period under study, S? has a declining 
trend. However, holding constant the real balances term, which has a 
strong correlation with time, movements in S? can be interpreted as 
temporary deviations from a “normal” trend. These fluctuations are 
assumed also to be correlated with the fraction k,. The assumption 
here is that given relatively high costs associated with temporary shifts 
in tax collection and debt issue, transitory movements in spending 
would be financed primarily by adjustments in money issue. A posi- 
tive correlation between S? and k, would then result. However, a 
sufficiently high value of S? could be presumed to require extraordi- 
nary finance of the sort previously mentioned, so that k, might even- 
tually decline. 

Incorporating an approximation of this effect into equation (19) 
results in the following generalized expression: 


M? i M?-1 


Mm =a'+b)’' log ( 


| Oe aes 
aimee log S? + u', 


which can be rewritten as 


Mi Mit = a! — 6'(Mys + (Mr Me) ed) +S; +u', (20) 
t 
where variables without a superscript are again in logarithmic terms. 
In order to proceed with the formulation of the prior expectation, 
it is convenient to replace the logarithmic growth rate (M,; — M,_,) on 
the right-hand side by the growth rate measured by (M? — M?_,)/M?. 
While this rate is always lower than M; — M,_,, the gap widening the 
higher the growth rates, this effect can hopefully be captured ap- 
proximately by the coefficients in the estimated equation. Then, 


® See Statistisches Reichsamt 1925, p. 53. 
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equation (20) can be solved for (M? — M?_,)/M? to yield 


0_ yo ] 
a ae =e) + Sit (21) 
t 





I : T+5 Tee 


The prior conditional expectation is defined accordingly as 


el ee ee er a 


The unperceived part of money growth m;’° is then computed by the 
difference between actual growth and g;, namely, 


= (M? — M?_,)/M? — ge 


The coefficients in equation (22) are those which result from re- 
gressing (M? — M?_,)/M? on S; and (M;_, — e). However, the exchange 
rate is not in general an exogenous variable in a money growth 
equation. A correlation between e, and the error term 4% will exist via 
some unspecified condition for equilibrium in the foreign exchange 
market. Therefore, the coefficients in (22) do not correspond exactly 
to those in equation (21). This property is not a drawback. On the 
contrary, the bias in the estimated coefficients (relative to those in 
[21]) reflects the part of u, that can be estimated from e;. It therefore 
should be taken into account in calculating g;."" 

Turn now to the estimation of equation (21). There is a problem in 
matching the available data on money with those on prices for the 
German hyperinflation. Unlike the price series, which consist of 
monthly averages, the available data on the money stock until January 
1923 are end-of-month figures.’”? From January 1923 onward, four 
quotations per month are available. Thus, a proxy is constructed for 
the monthly average money stock. For the period January—July 1923 
it contains averages of the beginning-of-month, end-of-month, and 
the three intermediate quotations available. Until December 1922, the 
monthly averages are approximated by linear interpolation of the 
end-of-month figures. The constructed series are shown in table 2. 

A further consideration arises. The estimation of equation (21) 
from monthly average data on the money stock, rather than end-of- 


10'The unperceived growth m, does not correspond exactly to the error term u,. On 
this point, see below. 

‘1 E.g., in the general linear model y = XB + u, where the variables in X are correlated 
with w, the estimated vector of coefficients is B = (X'X)- 1X'y = (X'X)1X'(XB + wu), or B 

= B + (X'X)'X'u, where (X'X)'X'w is the regression coefficient of u on X. The 
mredicnen of y given the values of the vector x; is, accordingly, y, = x¢[B + (X'X)1X'u]; 
i.e., it is composed of the systematic part x/B, plus the conditional expectation of u; given 
Xe. 

2 Another problem stems from the form of the prior expectation, which, as it stands, 
requires the use of end-of-month money stocks. See the discussion below. 


DISPERSION OF RELATIVE PRICES 343 


TABLE 2 


VaLuEs (in Logarithms) or REAL EXPENDITURE, 
EXCHANGE RaTE, AND Money Stock 














S; et M, 

Oct. 1920 5.85 2.79 11.24 
Nov. 6.44 2.91 ee 2 5 
Dec. 6.58 2.86 11.28 
Jan. 1921 6.42 2.74 11.29 
Feb. 6.80 2.68 11.28 
Mar. 6.66 2.70 11.29 
Apr. 6.65 2.71 11.30 
May 6.53 2.69 11.30 
June 6.76 2.80 11.33 
July 6.43 2.91 11.36 
Aug. 6.76 3.00 11.38 
Sept. 6.23 eee 11.42 
Oct. 5.95 3.58 11.48 
Nov. 5.54 4.14 11.55 
Dec. 6.44 3.82 11.66 
Jan. 1922 5.96 3.82 11.72 
Feb. 5.84 3.90 11.75 
Mar. 5.85 4.22 11.81 
Apr. 5.77 4.24 11.88 
May 5.93 4.24 11.96 
June 5.75 4.33 12.05 
July 5.34 4.77 12.16 
Aug. 5.32 5.60 12.33 
Sept. 6.08 5.86 12.58 
Oct. 5.60 6.63 12.92 
Nov. 5.30 7.44 13.35 
Dec. 6.08 7.50 13.85 
Jan. 1923 5.32 8.36 14.26 
Feb. 5:63 8.80 14.82 
Mar. 6.55 8.53 15.31 
Apr. 6.15 8.67 15.60 
May 5.65 9.34 15.83 
June 6.21 10.17 16.29 
July 6.16 11.34 17.14 

Note.—S, = log of the monthly government expenditure in millions of gold marks. Source: Bresciani-Turroni 


(1937), pp. 436-37. e, = log of the monthly average exchange rate of the gold marks in paper marks. 
Source: Bresciani-Turroni (1937), p. 441. M; = log of the monthly average money stock in millions of paper marks. 
Source: based on fixed days quotations from Sonderhefte zur Wirtschaft und Statistic (1925), pp. 45-47. 


month figures, means that both the current month’s spending and 
that of the previous month should be considered. Spending financed 
by money issue during the previous month increases one to one the 
current monthly average stock but has, in general, a weaker effect on 
the prior month’s average stock." To account for this effect, lagged 


* This effect can be seen by considering first a case where each month’s spending is 
spread evenly over the month, and, say, it is financed only by the issue of new money. In 
this case the money stock grows linearly at, in general, different rates within each 
monthly period. Then, if M?_2,enq denotes the money stock at the end of month t — 2, 
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spending is incorporated into the framework of the semilogarithmic 
function in equation (21). First, define the variables Sj and e; by S; = 
logdésy 47 (p= €)Soe tand<c=—slog [e8ears(1 = €)S?_,]. After 
substituting S; and e; for S, and e, in (21), € is estimated, simulta- 
neously with the other coefficients in the equation, using a nonlinear 
maximum likelihood procedure under normally distributed errors.!4 


The estimated nonlinear equation is 


M ie MESS YEG0 = 166 (i ef) PLS: 23) 
ako aan : (Mig 1) 1h - t ( 
: (.071) (.007) (.014) 
(2775, 27 = 937D-W — 16370 — .026, 33 observations), 
(.11) 


where the numbers in parentheses are the standard errors of the 
coefficients. With respect to the number of observations, the starting 
month was taken as November 1920 in order to test lagged effects of 
monetary shocks on price dispersions. According to the argument of 
note 13 above, the value .75 for € suggests a pattern of spending that 
is biased toward the beginning of the month. In order to proceed with 
the analysis on the more familiar ground of linear equations estima- 
tion, € is assumed henceforth to equal .75. Given this value, the 
standard errors of the other coefficients, linearly estimated, are not 
materially different from those obtained above. In order of their 
appearance in (23), they are .068, .005, and .014. The standard error 
of the regression is now 0.0256. The other regression statistics and the 
coefficients remain obviously the same. 

The pattern of the residuals from equation (23), which are reported 
in table 4, suggests that their variance increased during the sample 
period as inflation progressed. A method similar to that proposed by 
Glejser (1969) is adopted to correct for the apparent heteroscedastic- 
ity by assuming a specific model for the variance of the error term. In 
this procedure the variance is postulated to be determined by a set of 
variables {z;} in the linear form: 


Cae X wzir (24) 


the monthly average for ¢ — 1 equals M;-2,na + ¥2 S?_,e?-1, and that corresponding to 
month t equals M?_»,ena + S?_1, e?-1 + % S?e?. Thus, the increase in the monthly average 
from t — | tot equals 4S? _,e? , + 4 S%e?. Namely, spending evenly spread over each 
month would imply equal weights for current and lagged spending in the money 
growth equation. Alternatively, if spending is concentrated at the beginning of the 
month, the relative weight of lagged expenditure would be lower. At the extreme, e.g., 
if all spending is made only on the first day of each month, both t — 1 and ¢ monthly 
averages increase equally by the amount of the ¢ — 1 expenditure. In this case lagged 
spending does not belong in the money growth equation. 

14 This procedure is from the TSP Regression Package. 
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If the values of the true money shocks m# were available, one could 
use them as follows. Since the expectation of mj? is 02, it follows that 


Me — oe 1 Op 
where v, is of zero mean. Combining the last two equations yields 
mF = Lorir ate (25) 


Estimates of the coefficients in equation (24) could be obtained by 
regressing m7? on the z; variables. The heteroscedasticity problem is 
also present here, but it will be ignored in what follows. 

The values of mj are, however, unknown; only the estimated re- 
siduals m; are available from the OLS money growth equation. Since 
m: converges to m#? asymptotically, the variance estimated using m? 
values obtained from small samples will be biased. This bias is ne- 
glected, as it is probably of relatively small importance. 

In order to proceed with the implementation of this procedure the 
set of the z; variables must be specified. The presumption is that the 
same variables used to explain the growth rates are also correlated 
with the variances. Thus, S/, e/, and M,-; are candidates. The lagged 
squared residual m?_, is also included as an explanatory variable. It 
presumably captures the effect of serially correlated omitted variables 
and perhaps a direct correlation between oO 7-, and the current vari- 
ance. The estimated variance equation is 


— .007 — .00038S; — .00026¢; + .00090M,_, — .28172_, 
(.003) (.00047)  (.00031) —(.00047) (.130) (26) 


(R? = .57, D-W = 2.5, o = .001, 33 observations).!5 


mi 


Nothing in this procedure for estimating the series of money vari- 
ances guarantees that all the fitted values from equation (26) would be 
positive. Indeed, two of the fitted values have a negative sign. In order 
to use the estimated series as a measure of variances, these two 
negative values are replaced with the smallest positive value in the 
series. The series of the square roots of these estimates are reported in 
column 4 of table 4. 

Using this series as weights for the corresponding observations, 
equation (23) is reestimated with the following results: 


(M? — M?_,)/M? = .700 — .154(M,_, — e/) + .0945/ 
(.047) (.006) (.012) (27) 
(R? = .95, D-W = 1.8, 6 = 1.01, 33 observations). 
* In order to estimate the 33 variances needed to reestimate eq. (34), the 7, series 


were obtained from running the money growth equation after adding the additional 
observation of October 1920. 
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Observe that the coefficient of S; here is somewhat lower than in the 
OLS equation and that the coefficient of (My-1 — ef) 1s somewhat 
higher. The general form of the equation is, however, robust to this 
transformation of the data. 

The next step is to test the stability of the coefficients in equation 
(27) across two subperiods. Stability of the coefficients has particular 
relevance here. If the equation is approximately stable, it seems easier 
to assume that it was known from the beginning of the period and 
that perceptions about money growth were formed using the same 
equation during the entire sample.!® The period is divided into an 
approximately nonaccelerating money supply period until May 1922 
and an accelerating phase beginning in June 1922. An F-test applied 
to these two subperiods yields the statistic Fe, = 1-7, witha. corre- 
sponding 5 percent critical value of 3.0. Therefore, the hypothesis of 
stable coefficients across these two subperiods cannot be rejected at 
the 5 percent significance level. 

A regression in which the coefficients of M,;-, and e; are uncon- 
strained produces coefficients of similar magnitude for the two vari- 
ables. The F-test for the linear constraint of equal coefficients yielded 
the statistic Fig = 2.3, where the 5 percent critical value is 4.2. 


IV. Empirical Test of the Dispersion Equation 


The tests of the price dispersion model in equation (15) are= per- 
formed using the dispersion series computed in Section II and the 
unperceived monetary shocks as measured by the residuals in equa- 
tion (27). 


For convenience equation (15) is rewritten here 


y? = {2(1 — 0)'o% + 2[6 + A(1 — O)1?}o2 + (1 — OP OR CM — Mes)”. 
(15) 


Significantly, this equation has a simple linear form—and can 
therefore be tested by an ordinary least-squares procedure—only 
under constant money and relative shocks variances. However, the 
analysis of money growth in the previous section suggested that a7, 
increased during the hyperinflation. If this is indeed the case, it would 
not be appropriate to test the model with a specification that relates 
price dispersion to money shocks with a constant coefficient. 

Two different procedures are adopted here to deal with the possi- 
bility of a changing money variance. The first uses a linear approxi- 
mation in which the variance of money—as measured by the G7, series 


16 Estimation of a money growth equation in a similar context, using the entire 
sample (for U.S., 1941-73), was discussed and performed in Barro (1977). 
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from Section III—is kept constant by including it additively in the 
equation. As discussed in Section I, 2, has different and opposite 
effects on y3, and, therefore, on an a priori basis the coefficient of 62, 
could take either sign. The other attempt is to estimate (15) as a 
nonlinear equation. The results of this procedure, reported later in 
the section, are quite poor. 

The estimated equation in which 62, is added linearly is: 


yt = .033 + 17.4(m, — my)? — 15.862, (28) 
CULO). (3.2) (8.6) 


(= 309, DW = 11: 25G = 022.30 observations). 


The monetary shocks appear to have considerable explanatory 
power for price dispersion. The coefficient of 62, is negative and, 
therefore, suggests a dominant Lucas-type effect of the money vari- 
ance on price dispersion. That is, the degree of dispersion associated 
with given shocks diminishes the higher their variance. The ex- 
planatory power of G3, however, is fairly low; its coefficient is 
significantly different from zero at the 5 percent level but fails to be so 
at the 2.5 percent level. 

Theoretically, the 1-month lagged money variance 62,_, belongs 
also in the equation. However, when included, its coefficient is in- 
significant with a ¢-statistic of .8. 

The Durbin-Watson statistic indicates autocorrelated residuals, 
which may be caused by omitted real variables (like changes in the 
pattern of government spending,'’ in income distribution, etc.) that 
are serially correlated or by the fact that the m, variable includes an 
estimation error. In order to check whether the degree of significance 
of the estimated coefficients in (28) is affected by this autocorrelation, 
the equation is reestimated using the Cochrane-Orcutt technique. 
The results are quite similar to those obtained before: 


y7 = .038 + 16.8(m, — m,_,)? — 18.562, (29) 
(.008) (2.9) (9.8) 


(R? = .67, D-W = 2.2, 0 = .020, 29 observations, 6 = .36). 
17) 


Including 6;,,_; in this regression yielded a t-statistic of only 1.0 for 
its coefficient. The possibility of lagged effects of monetary shocks on 
price dispersion was explored by including the variable (m,_, — m_2)? 
in the equation, but the estimated coefficient was found statistically 
insignificant. The ¢-ratio was 0.3 in the OLS regression and 1.4 using 
the Cochrane-Orcutt technique. 


‘’ The magnitude of changes in the amount of real government spending, however, 
does not have any significant explanatory power. 
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The next step is to see whether the dispersion equation is stable 
over the entire period. In fact, the inclusion of the 6?,, variable is an 
attempt to control one source of instability in the coefficients. In order 
to carry out this test the sample is divided first after May 1922, 
estimating the equation separately for the two subperiods. This parti- 
tion of the sample is the same one adopted previously to test the 
stability of the money growth equation. Then, the exercise is re- 
peated, dividing the sample at the end of 1922. The aim of this 
partition is to see how the model performs after removing from the 
sample the 7 months of 1923, which had a much more unstable 
monetary growth. 

The results of these regressions, reported in table 3, can be sum- 
marized as follows. When the sample is divided in May 1922 (first 
column), the results for the first subperiod are fairly weak. Both 
coefficients are insignificant in the OLS equation. Using the 
Cochrane-Orcutt technique the coefficient of (m; — m;,-,)” turns out 
significant at the 5 percent level, although that corresponding to Gj; is 
still insignificant. During the second subperiod—from June 1922 to 
July 1923—the statistical performance of the equation is much 
stronger. 

The second column reports the equations estimated for the periods 
through and after December 1922. Observe that the removal of the 
1923 portion of the sample worsens the performance of the model as 
judged by the size of the ¢t-ratios. However, the coefficient of the 
monetary shocks still remains quite significant. 

Formal F-tests fail to reject the hypothesis of stable coefficients 
across the mentioned subperiods. When the sample is divided in May 
1922, the resulting statistic is F3, = 1.8, while the 5 percent critical 
value is 3.0. Partitioning the sample at the end of 1922 yields the 
statistic of only 0.8. 

The other approach adopted to test the dispersion equation was to 
treat it as a nonlinear relationship. When o%, is changing over time, 
equation (15) generalizes to 


2 2 2 2: 
FeO Oe Oe 
SST OPS ee te eee 
a 208 lager eae Gara 


\Om + 2 AOmn—-1 + 2 
2 2 2 
+O eee iy Ae Se aaek 
Neh da C2 hh Re hae tlio alte 


As discussed in Section I, the third term in this equation reflects the 
negative effect of o7, on the impact of monetary shocks. The second 
term corresponds to Barro’s (1976) relative price variance expression 
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that is affected positively by o% when 0 < A < 1. Recall that A is 
defined as the average of 1/[a‘(z) + a] across markets, and a*(z) + at 
is the excess demand elasticity of commodity z. Using again the 67, 
series, the parameters of equation (30)—A, o2, and o%—are estimated 
using a nonlinear least-squares procedure with fairly weak results. 
The additional structure given to the equation seems to be rejected by 
the data, as judged by a higher sum of squared errors than in the 
linear equation. 

Observe that a value of zero for \ reduces the equation to the linear 
form of equation (15), in which the money variance is constant. The 
estimated value for this parameter was 0.056 with a standard error of 
0.065.!® The interpretation of this result is not that the average 1/ 
[a%(z) + a] is likely to be close to zero. Instead, it suggests that the 
detailed specification of equation (30) is too stringent. For example, if 
the variance of the relative excess demand shifts, a2 also changes over 
time; this is more of a problem in this approach, since o? itself is being 
estimated as a constant. 


V. Actual Money Growth and Inflation in the 
Dispersion Equation 


In this section additional variables that were mentioned in the litera- 
ture as being related to price dispersion are tried in the equation. 
There is no rigorous theoretical justification for their inclusion. Thus, 
only loose verbal explanations are given. Also, the variables related to 
inflation are clearly not exogenous, and, therefore, any observed 
correlation cannot imply causality. 

Actual money growth and price dispersion.—A variable that can be 
considered exogenous and, if one assumption of Section I is violated, 
in principle also relevant for price dispersion is the actual money 
growth. Changes in the money stock can affect the dispersion of 
prices (even when perceived) if the new money is spread unevenly 
across the economy, thereby affecting relative demand in different 
sectors. This type of effect was discussed by Cairnes (1873) with 
respect to gold discoveries. In the framework of the present model 
this type of effect could be represented by changes in the relative 
excess demand variance o?. Since here the focus is on price change 
dispersion, the corresponding variable in this context is the change in 
the growth rate or the degree of acceleration/deceleration in the 
money stock. 

In order to test this sort of effect, and also to see whether (m, — 


18 G, was 17.7 with a standard error of 8.4 and 62 was 0.0008, with a standard error of 
0.0004. 
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m1)” is only a proxy for changes in actual money growth, price 
dispersion was regressed on [(M? — M®_,)/M? — (Mi. — M3_.)/M°_,)? 
with the following results: 





yi = .033 +24 a pene ei ~eay 
(.006) (.6) M? Me, 


(R? = .33, D-W = 1.5,0 = -028, 30 observations). 


The monetary acceleration/deceleration variable has a statistically 
significant correlation with price dispersion. Remarkably, however, its 
explanatory power vanishes when (mM, — mr)? and G2, are also in- 
cluded in the regression. The equation including the three variables is 


3 == .034 Ste 16.6(m, —s. Oise = 16502; to 


(4.0) (9.0) 
25 a = 4 Mees od 
(77)\ Me M‘-1 


(R? = .59, D-W = 1.3, o = .022, 30 observations). 


This result denies the existence of any effect of relative demands 
following the introduction of new money during this period. It sup- 
ports the hypothesis that money affects relative prices only if it is 
currently unperceived. 

Inflation and price dispersion.—In his analysis of price behavior dur- 
ing the hyperinflation, Graham describes a positive correlation be- 
tween the acceleration of price level and the dispersion of prices. 
Mills’s findings in his study of U.S. prices suggest that dispersion is 
positively correlated with both acceleration and deceleration of the 
price level. 

If the acceleration/deceleration in the price level is related to an 
unperceived monetary expansion or contraction, the theory tested 
here predicts that the correlation mentioned above should be cap- 
tured by a variable measuring unperceived money growth. To test 
whether this is the case here, the variable (1, — Mz-1)” Is also included in 
the equation—where y, is the inflation rate from month t — 1 to 
month ¢ computed from the wholesale price index (see table 4). The 
results suggest that there is a separate correlation between (u, — y,_ we 
and price dispersion. The equation estimated by OLS is 


yi = .032 + 18.6(m;, = M1)" 7 24.267, =e O74 (uy — Pay)” 
(005) (3.0) (8.7) (.031) 


(R? = .67, D-W = .9, o = .020, 30 observations). 


Given the low D-W statistic, the equation was reestimated by 
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Cochrane-Orcutt: 


yi a .033 Sia 19.5(m; a My)" a 95.3624 ae O77 (We + (tea) 
(.008) (2.6) (9.0) (.022) 


(R? = .77, D-W = 2.00, o = .017, 29 observations, p = .49). 
(.16) 


Not only does (4; — Mr-1)” have a statistically significant correlation 
with price dispersion, but its inclusion in the equation also sharpens 
the performance of the original two variables. Possible explanations 
of this correlation could be related, for example, to income redis- 
tribution following from unanticipated inflation or substitution be- 
tween money and certain commodities as stores of value, when their 
relative costs change. 

Sheshinski and Weiss (1977) consider a model of a monopolistic 
firm in which costs involved in changing the price of the commodity 
produced generate discrete periodic price adjustments whose mag- 
nitude increases with the inflation rate. They suggest that, if the 
timing of these adjustments is independent across firms, higher 
inflation rates imply larger dispersion of individual price changes. 
Hence, this type of argument seems to imply a relationship between 
price change dispersion and 47." 

A role for 4? may be rationalized also on different grounds. 
Graham mentions the effect of the interaction between government 
regulations and inflation on the dispersion of prices. He refers to 
price controls, such as rent restriction legislation, which generate 
dispersion among the differently affected prices as the general price 
level increases. According to this argument, the dispersion of prices 
will be positively correlated with the price level. Hence, the dispersion 
of price changes will increase with the magnitude of changes in the 
price level—namely with 7. 

Adding yu? to the equation yields the following results: 


8 However, the empirical implications of Sheshinski and Weiss’s analysis for price 
dispersion do not seem clear to me. An ambiguity arises because of the probably 
positive effect of inflation on the frequency of price changes that they derived. If the 
length of the observation period is kept constant, a higher frequency of price change 
may diminish the measured dispersion of price changes. The possibility of a negative 
effect of inflation on price dispersion in this framework can be seen in the following 
example. Assume that the optimal frequency of price adjustments for all firms is 2 
months and that part of the firms adjust their prices during odd-numbered months and 
the rest during even-numbered months. Using monthly data, dispersion of price 
changes will depend on the magnitude of price changes corresponding to the group of 
firms currently adjusting prices. Now assume that inflation increases; as a consequence 
the magnitude of price adjustments goes up, and also the optimal frequency is 
increased—say to one per month. Since now all the firms adjust prices during the same 
month, the dispersion of price changes collapses to zero, in spite of the larger individual 
price changes. 
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yi = 031 + 12.2(ni, — my)? — 21.362, + .062(u, — pay)? + 03603 
(.005) (4.1) (8.2) (.030) (.017) 


(R? = .67, D-W = 1.3, o = .019, 30 observations). 


The coefficients of uw? and (4; — p;-1)? are both significant at the 2.5 
percent level. However, when the equation is reestimated by 
Cochrane-Orcutt, the effect of «4? becomes insignificant (¢-ratio of 
0.8), while the performance of the rest of the variables improves. If 
the monetary acceleration/deceleration variable is also included, its 
coefficient turns out negative but insignificant at the 5 percent level 
(t-ratio of 1.62 with OLS and 1.55 with Cochrane-Orcutt), without 
materially affecting the other coefficients and standard errors. 


VI. Summary and Conclusions 


This paper tested a model of price dispersion using data on the 
German hyperinflation after World War I. The theoretical frame- 
work, outlined in Section I, predicted that money growth causes 
price dispersion only if it is currently unperceived. 

In order to perform this test, the first task was to construct a 
measure of price dispersion based on a broad enough range of com- 
modities to approximate the varying degree of relative price vari- 
ability during that period. This calculation was accomplished using 
wholesale price data on a sample of over 60 commodities, including 
foodstuffs, textiles, metals, and fuels. 

The next step was the delicate one of measuring the perceived part 
of money growth. This estimation implied the postulation of an in- 
formation set available at each point in time and also a function 
relating the variables in this set to the creation of money. The func- 
tion used to isolate the perceived part of money growth was specified 
based on considerations related to the government demand for reve- 
nue to finance expenditure. 

A measure of the variance of the unperceived money growth was 
also estimated. This series was used for dealing with the hetero- 
scedasticity problem in the money growth equation and also as a 
theoretically relevant factor affecting price dispersion. 

The testing of the dispersion equation showed a statistically 
significant correlation between unperceived money—as measured 
here—and price dispersion. Thus, the model is supported to some 
extent by the empirical results. The variance of the money shocks 
turned out to have a negative coefficient which was only marginally 
significant. Although the money variance has different and opposite 
theoretical effects on price dispersion, this negative correlation is 
consistent with a dominant effect of diminishing the misperceptions of 


356 JOURNAL OF POLITICAL ECONOMY 


monetary disturbances as relative shifts—and thus reducing the 
dispersion associated with those disturbances. 

In Section V actual money growth was also considered as a price 
dispersion factor, justified as affecting relative demands. However, 
the results indicated no additional effect for the rate of money cre- 
ation. The magnitude of changes in the inflation rate does appear to 
have a separate correlation with price change dispersion. The present 
model, however, does not provide an explanation for this association. 


References 


Barro, Robert J. “Rational Expectations and the Role of Monetary Policy.” J. 
Monetary Econ. 2 (January 1976): 1-32. 

. “Unanticipated Money Growth and Unemployment in the United 
States.” A.E.R. 67 (March 1977): 101-15. 

Bresciani-Turroni, Costantino. The Economics of Inflation. London: Allen & 
Unwin, 1937. 

Cairnes, John E. “The Course of Depreciation.” In Essays in Political Economy: 
Theoretical and Applied. London: Macmillan, 1873. 

Glejser, H. “A New Test for Heteroskedasticity.” J. American Statis. Assoc. 64 
(March 1969): 316-23. 

Graham, Frank D. Exchange, Prices and Production in Hyper-Inflation: Germany 
1920-1923. Princeton, N.J.: Princeton Univ. Press, 1930. 

Lucas, Robert E., Jr. “Some International Evidence on Output-Inflation 
Tradeoffs.” 4.E.R. 63 (June 1973): 326-34. 

Mills, Frederick C. The Behavior of Prices. New York: Arno (for Nat. Bur. 
Econ. Res.), 1927. 

Parks, Richard W. “Inflation and Relative Price Variability.” ].P.E. 86, no. 1 
(February 1978): 79-96. 

Phelps, Edmund S. “Introduction: The New Microeconomics in Employment 
and Inflation Theory.” In Microeconomic Foundations of Employment and 
Inflation Theory, by Edmund S. Phelps et al. New York: Norton, 1970. 

Sheshinski, Eytan, and Weiss, Yoram. “Inflation and Costs of Price Adyjust- 
ment.” Rev. Econ. Studies 44 (June 1977): 287-303. 

Statistisches Reichsamt. Statistisches Jahrbuch fiir das deutsche Reich. Berlin: 
Hobbung, 1921/22 and 1923. 

. Sonderhefte zur Wirtschaft und Statistic: Zahlen zur Geldenwertung in 
Deutschland 1914 bis 1923. Berlin: Hobbung, 1925. 

Stigler, George J., and Kindahl, James K. The Behavior of Industrial Prices. New 
York: Columbia Univ. Press (for Nat. Bur. Econ. Res.), 1970. 

Vining, Daniel R., and Elwertowski, Thomas C. “The Relationship between 
Relative Prices and the General Price Level.” 4.E.R. 66 (September 1976): 
699-708. 








Exchange-Rate Dynamics: An Empirical 
Investigation 





Robert A. Driskill 


University of California, Davis 


This paper estimates a reduced-form exchange-rate equation whose 
estimates are used to address questions on exchange-rate over- 
shooting, intermediate-run exchange-rate dynamics, and long-run 
proportionality relationships between relative money supplies and 
exchange rates. Based on Swiss—U.S. data from the period 1973-79, 
the major findings are that following a monetary shock there is 
short-run exchange-rate overshooting by a factor of about two, and 
that subsequent exchange-rate adjustments to a new long-run equi- 
librium take longer than 2 years and exhibit nonmonotonic patterns. 


I. Introduction 


Much of the recent literature on floating exchange rates has viewed 
relative money supplies as the basic determinant of exchange rates 
and relative price levels. While this literature is in agreement about 
the long-run relationships between money, exchange rates, and price 
levels, it differs in its predictions about short- and intermediate-run 
exchange-rate determination.’ In particular, one strand of this liter- 
ature, mainly associated with the work of Dornbusch (1976), has 
emphasized the role of slowly adjusting commodity price levels for 
short- and intermediate-run exchange-rate dynamics. The most 
striking implication of the Dornbusch model is that the exchange rate 
may “overshoot” in the short run: In response to a change in relative 


This study stems from my doctoral dissertation. Grateful acknowledgments are due 
to my primary advisor, Jurg Niehans, and also to Bela Balassa, Jeffrey Frankel, Dale 
Henderson, George Kanatas, Ed Kane, Jerry Thursby, and an anonymous referee. 

1 For a discussion of these different approaches, see Bilson (1978). 
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money supplies, the exchange rate may immediately change by more 
than its long-run equilibrium value. Another implication of the model 
is that, following an initial overshoot, the exchange rate then mono- 
tonically approaches its long-run equilibrium value. Both of these 
implications result from the key assumptions of slowly adjusting 
commodity prices and perfect capital substitutability and mobility. 

Within the theoretical framework of slowly adjusting commodity 
prices, an alternative to Dornbusch’s view of exchange-rate dynamics 
has developed, an alternative which emphasizes stock/flow interac- 
tions and relative-price trade-balance effects. This literature, devel- 
oped by, among others, Branson (1976), Niehans (1977), and Hen- 
derson (1980), alters the strong Dornbusch conclusions about short- 
run overshooting and monotonic exchange-rate and _ price-level 
adjustments to long-run equilibrium, instead permitting short-run 
undershooting and nonmonotonic exchange-rate and _ price-level 
adjustments. 

The purpose of this paper is to examine empirically the predictions 
of these exchange-rate dynamic models against Swiss/U.S. data over 
the period 1973-77. Specifically, the paper addresses the questions of 
whether there is short-run exchange-rate overshooting, whether the 
intermediate-run exchange-rate and price-level adjustments are 
nonmonotonic, and whether the long-run proportionality relation- 
ships between money supplies, price levels, and exchange rates hold. 

This is not the first attempt to test a model of exchange-rate deter- 
mination based on the Dornbusch exchange-rate dynamics approach. 
Frankel (1979) extends the Dornbusch model by incorporating secu- 
lar inflation rates and tests it against the deutschmark/dollar rate, and 
Bilson (1978) provides suggestive evidence about the appropriate- 
ness of the Dornbusch approach for the deutschmark/pound rate. 
Neither test, however, uses an exchange-rate equation rich enough 
to incorporate trade-balance responses to relative price changes and 
thus exclude the possibility of more complex, perhaps oscillatory 
exchange-rate adjustment paths and exchange-rate undershooting. 

The major empirical findings from this study are as follows: (1) The 
exchange rate overshoots in the quarter in which a monetary change 
takes place by a factor of about two. (2) The exchange-rate adjustment 
path to full equilibrium is not monotonic, as predicted by the original 
Dornbusch fixed-output model, but rather exhibits periods of ap- 
preciation and depreciation. The price level, however, adjusts mono- 
tonically. (3) Purchasing power parity holds in the long run. As noted 
above, there is wide theoretical agreement that this should be the case, 
but it has often been difficult to verify empirically. Furthermore, the 
“long run” is calculated to be approximately 2-3 years. 
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II. The Reduced-Form Equations 


The goal of this section is to motivate the reduced-form exchange- 
rate and price-level equations which are estimated in the following 
section. Reduced-form equations are, of course, consistent with a 
variety of structural models. What is done here is to develop two 
structural models, the Dornbusch model and a stock/flow model de- 
veloped by generalizing the Dornbusch model to permit imperfect 
capital mobility. Both of these structural models impose a priori 
constraints on the reduced-form parameters and thus can in principle 
be rejected by the data. Of course, neither structural model may be 
sufficiently complex, and a “true” structural model may impose few 
theoretical predictions about a priori reduced-form constraints. This 
possibility emphasizes the fact that the reduced-form empirical esti- 
mates are more general than either structural model. 


A. The Dornbusch Model 


This subsection recasts the Dornbusch model in discrete time. The 
model consists of three basic building blocks: a money-market equilib- 
rium condition, a price-level adjustment equation, and an uncov- 
ered-interest-arbitrage specification. Assuming that both the domes- 
tic and foreign country have identical structural parameters, we can 
specify money demand as: 


me =P + OY; = AG Uz, (1) 


where m is the log of the ratio of the domestic to foreign money 
supply, p is the log of the ratio of the domestic to foreign price level, y 
is the log of the ratio of domestic to foreign real income, r is the 
difference between domestic and foreign interest rates, \ is the inter- 
est rate semielasticity of the demand for money in each country, ¢ is 
the income elasticity of the demand for money in each country, and v, 
is a serially uncorrelated random variable with zero mean and vari- 
ance a2. 

Each country’s money supply is assumed to be exogenously con- 
trolled by the respective government. Furthermore, all changes in the 
relative money supply are assumed to be unanticipated; this assump- 
tion is important because it implies that this period’s relative money 
supply is also the expected supply for all future periods. Assuming 
equilibrium obtains each period, we can write the equilibrium condi- 
tion as: 


m, = (m,)* = pp + OY; SAME UE (2) 
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In the goods market, relative demand for output depends on rela- 
tive real income, the relative interest rate, and relative prices and has 
the following functional form:? 


log Dy = yy — OT, + Or — Pt), (3) 


where ¢ is the log of the exchange rate and D is demand for domestic 
output. 

The rate of relative inflation, p,; — p;, is proportional to the log of 
the ratio of relative demand to relative supply in the goods market: 


Paavnpa= 5(log D; — 9;). (4) 


Combining equations (2), (3), and (4), we can write the relative price 
equation as: 


Pe = Gt-1 + A Pt-1 + A.M, + A3ey-1, (5) 


where: a, = 6(1 — y) + /A, a, = 1 — 80/A — 8a, ag = 50/X, and az = 
ba. 

The final building block of the Dornbusch model is a joint assump- 
tion of uncovered interest arbitrage and exchange-rate expectations. 
The uncovered interest arbitrage assumption can be stated as 


Cp =1Xe =e (6) 


where x, is the expected change ine fromt tot + 1. The assumption 
about x, is that it is a fraction of the gap between the current and 
long-run equilibrium exchange rate. Given the assumption that the 
relative money supply follows a random walk, this implies the follow- 
ing relationship: 


x, = O0(m, — e,) +k, O< 06 < 1,k a constant. (7) 


Combining equations (2), (5), (6), and (7), we can derive the following 
reduced-form equation: 


€p = Wot wept Wm, bears wy TWaPr—1 + Ws)¢ 
+ WeYe-1 + 172, (8) 


where z, is a first-order serially correlated random variable and where 
the 7;'s satisfy the following constraints:3 


* Output demand could be specified as a function of the real interest rate. In both the 
Dornbusch and the stock/flow models, this change makes no difference in the qualita- 
tive implications derived for the reduced-form equations. 

3 Details are found in Appendix A. 
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4 
> te= 1 


i=1 3 < 0 
Ty <0 TT 4 <0 
I> > 1 aise < 0 TT ¢ < 0. 


The constraint on the sum of the first four 7;’s simply says that 
purchasing power parity holds in the long run. The constraint that 
72 > 1 says that there must be short-run overshooting. 


B. The Stock/Flow Model 


As noted in the Introduction, the stock/flow model is developed by 
generalizing the Dornbusch model to permit imperfect capital mobil- 
ity. To do this, a net demand for foreign assets is specified as a linear 
function of the expected net yield: 


By = n(x — 72), n > 0. (9) 


In addition, a trade-balance equation is specified as a linear function 
of the log of relative prices and the log of relative real incomes: 


Ty, = ae; — fr) — Bye + ey, a,B=0, (10) 


where wu, is a zero-mean, finite-variance, serially uncorrelated random 
variable.* 

The foreign-exchange market-clearing equation states that net 
capital flows equal net trade flows plus all other autonomous flows 
(assumed constant): 


ABe= Tet, (11) 


where A, is a constant. Replacing equation (6) in the Dornbusch model 
by equation (11), we can derive the following reduced-form ex- 
change-rate equation: 


p= ty We + yi; + 
, ' / Poul: 
+ W4pi-1 + Ws5Ye + WeYi-1 + 77%. (12) 


The 7;’s satisfy the following constraints:° 


4 The term 7, could be specified as a function of current and lagged relative prices 
where the current-price effect is perverse, i.e., creates a J-curve effect. This specifica- 
tion is indistinguishable from (10) in terms of restrictions on reduced-form param- 
eters. 

> Details are in Appendix A. 
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7 = 1 


— 7; 20 
aT ek 14 > 0 
7, >0 Tw, = 0 1,6 = 0. 


While the purchasing power parity constraint holds just as with the 
Dornbusch structural model, the other constraints are quite different. 
In particular, both the lagged exchange-rate and lagged price-level 
coefficients may be positive in this model. The estimated sign of these 
coefficients should provide a sharp test between these two models. 
Also note that 7; need not be greater than one; this model makes the 
question of overshooting or undershooting an empirical question. 


III. Estimation Results 


The reduced-form equations (5), (8), and (12) were estimated for 
Swiss/U.S. data with two changes. First, after some experimentation 
with various proxies for Swiss income (data on which are available 
only on a yearly basis), the income variables y, and y,_, were dropped. 
None of the proxies used in the experimentation provided significant 
coefficients, and the remaining coefficients were hardly affected when 
income was dropped. Second, two dummy variables were added to 
the exchange-rate equation: OIL, which takes the value of 1 for 
December-January-February 1973-74 and zero for all other periods, 
reflecting the improved attractiveness of dollar-denominated assets 
following the announcement of the oil embargo and cuts in Arab oil 
production; and SEAS, which takes the value of 1 every December- 
January-February and zero all other periods, capturing the pro- 
nounced year-end demand for Swiss francs by Swiss firms for end- 
of-year “window dressing” of their financial statements.® 

The choice of the Swiss franc/U.S. dollar rate for the period 
1973-77 is based on the relative “cleanliness” of the franc float, the 
substantial variation in the franc-dollar rate, and the independence 
over this period of the franc and dollar from the “snake.” 

The estimation uses quarterly average data, where the quarterly 
data were generated by averaging three monthly end-of-period 
figures; that is, March-April-May is the average of March, April, and 
May. This provides a sample size of 19 observations, beginning March 


® Schiltknecht (1976) discusses this point. 
7 Schiltknecht (1976) indicates that the Swiss have no exchange-rate targets for the 
Swiss money supply at least for periods longer than a month or two. 
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1973 and ending November 1977.8 Even though monthly data would 
have had the advantage of more observations, the quarterly speci- 
fication was used since it is more consistent with the theoretical 
models developed, that is, money-market equilibrium and trade-bal- 
ance adjustment. While it would be possible to specify partial 
asset-adjustment schemes and longer trade-balance lags and use 
the monthly data, this would introduce substantial multicollinearity 
problems, precluding precise parameter estimates. 

Sources and methods of deriving the data are explained in Appen- 
dix B. The price index used was the consumer price index, and the 
monetary aggregate used was M3. Not surprisingly, the results were 
sensitive to the choice of monetary aggregate: Over the period in 
question, U.S. MI demand has not been very stable in comparison 
with M3 demand.? In the presentation of estimated equations, t-statis- 
tics are in parentheses beneath the estimated coefficient, and Durbin’s 
H statistic is presented.!° All estimations were done using White’s 
(1978) SHAZAM regression package. 

The estimated relative price equation is: 


Pt = .60 Ste 6p a meh Pee - (05 care 
(1.90) (4.58) (1.96) C73) 


R? (adj.) = .96, Durbin’s H = 1.67. 


(13) 


Hence, a, = .76, a, = .37, anda; = —.03. Botha, and a, are significant 
at the .05 level for a one-tailed test, and the hypothesis that a, + a, + 
a3; = 1 is not rejected at the .05 level. Since a; in (13) is insignificant 
and the wrong sign, the equation was also estimated with only p,_, and 
m,-, as regressors.'! The estimated equation is: 


Pp = DT ATED +291; 
(1.86) (4.74) ' (1.89) 


R? (adj.) = .96, Durbin’s H = 1.65. 


(14) 


Again, botha, anda, are significant, and the hypothesis thata, + a, = 
1 is not rejected at the .05 level. Finally, the equation was estimated 


® A regression using end-of-period quarterly data from 1973:II through 1977:III is 
also reported. More will be said about the choice between average and end-of-period at 
that point. 

® See Goldfeld (1976) for evidence on this point. 

1 Tn regressions with a lagged dependent variable, Durbin’s H statistic is the appro- 
priate statistic to test serial correlation of the disturbance terms. For a sample as small as 
used here, however, the usefulness is diminished. 

11 See Appendix C for a derivation of the conditions under which a, ~ 0, but the 
trade balance is still a function of relative prices. Basically it requires that the trade 
balance be a very small fraction of total aggregate demand. 
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with the constraint that a, + a, = | imposed: 


pe = 57 + 71 py, + 29-1. (15) 
(1.84) (4.47) (1.86) 


The estimates of the relative price equations, then, are consistent with 
the model and have good explanatory power.” 

Preliminary to testing the exchange-rate equation, it is useful to see 
whether the assumption used in both structural models that the rela- 
tive money supply follows a random walk, that is, that m, = m;_, + x,, 
where x, is a serially uncorrelated random variable with zero mean, 1s 
reasonably consistent with the sample data. To do this, two proce- 
dures were used. First, m, was regressed against a constant term plus 
the previous lagged values of m,. A representative result is: 


m,= .15 + .99m,_, — .54m,_, — .08m;-3 + ..77 m4 — 29m, + 25 Ula: 
(eile 2535) Meee OO). (tal) (1.50) (=.7.0) (2.9) 
(16) 


Only the coefficient on m;_, is significant, and it is not significantly 
different from one. The second procedure used a test suggested by 
Box and Jenkins (1970, p. 291) for randomness in a residual vector 
from a time-series process. For this test, m, was regressed on a con- 
stant and on m,_, with the following result: 


mp0) PERO may 


(.09)(13.9) (17) 


Residuals from this equation and from an equation in which the 
constant term and m,_, coefficient were constrained at (0, 1) were then 
used in the aforementioned test for nonrandomness. Nonrandom- 
ness was rejected for both sets of residuals at the 10 percent level. 
Consequently, the assumption that all changes in the relative money 
supply are unanticipated seems justified. 

The estimate of the exchange-rate equation 1s: 


2 The relative price equation can be disaggregated by country and written in the 
OlMoxmianes siosmame FE hy ae Gh Se aly ce Ghee, ae Gi N Se Gh). Ibi tne 
structural equations of Switzerland and the United States are indeed identical, the d;’s 
should satisfy the following restrictions: d, = 1, d, = —d,d, = —d;. This equation was 
estimated, and the above hypothesis about the d,’s was not rejected at the .05 level; the 
standard errors, however, were quite large. 

'8 As with the price equation, the exchange-rate equation was also estimated with 
country-disaggregated data to test the assumption of identical structural equations 


across countries. The equation is: e, = hy + h,ym$¥ + hym¥S + hgm$¥ + hym¥S, + 
hope + hepeS, +hze,-,. The coefficient restrictions consistent with identical struc- 
tural parameters across countries are: h, = —hy, hs = —h4,h, = —hg. The hypothesis test 


of these restrictions was not rejected at the .05 level; the assumption of identical 
structural parameters cannot be rejected. Furthermore, the point estimates were rather 
similar to those of eq. (18) and satisfied all the other a priori coefficient restric- 
tions. 
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Chto cent 4 oe, 28233 Ionia QAO match 93 pe, 
(—2.82) (3.65) (5.73) (5.60) (2:23) 


+ .150IL — .06SEAS; RHO = .35, 
(7.61) (5.47) (1537) 


R? (adj.) = .99, Durbin’s H = .21, 


(18) 


where RHO is the estimated first-order autoregressive parameter of 
the error term." 

The overall explanatory power of the equation is quite good. Fur- 
thermore, all coefficients except RHO are significantly different from 
zero at the .05 level for a one-tailed test. The Durbin H statistic 
indicates no serial correlation in the disturbances. 

As far as a priori constraints on the parameters are concerned, first 
note that the sum of 7, through zy, is 1.28, which is insignificantly 
different from one at the .05 level. That is, the estimated equation is 
consistent with the purchasing power parity proportionality predic- 
tion about changes in relative money supplies and changes in long- 
run equilibrium exchange rates. To further test this result, equation 
(17) was also estimated with this constraint imposed: 


Cpe 288 + bee, 230 mp 2:69 mL, 284 p.2 


(—2.89) (4.48) (4.92) (5.79) (197) 19) 
( 
+ .16O0IL — .06SEAS; RHO = .21. 
ie25) (= 51) (.79) 


These coefficients are quite close to the unconstrained ones, thus 
providing further evidence that the purchasing power parity princi- 
ple holds. 

Both structural models developed in Section II imply that the 
purchasing power parity constraint should hold. The two models 
differed sharply, though, on their predictions of the sign of the 
lagged exchange-rate and lagged price-level coefficients. The estima- 
tion results of positive coefficients on these variables are consistent 
with the stock/flow model but inconsistent with the Dornbusch one. 
Thus, the data reject the Dornbusch model. Note, though, that the 
Dornbusch model’s major insight about overshooting is verified as an 
empirical phenomenon. 

Two more procedures were carried out to examine further the 
robustness of the preceding empirical results. First, the exchange-rate 
equation was estimated using end-of-period rather than average data. 

14 Estimation of (18) was done using a modified Cochrane-Orcutt procedure in the 


SHAZAM computer program developed by Kenneth White. Grid search and maxi- 
mum likelihood nonlinear methods produced almost identical estimates. 
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While using end-of-period data increases the likelihood that any one 
data point is an outlier, it has the advantage of not possibly confusing 
anticipated and unanticipated changes in the money supply. The 
results are: 


€, = —3.16 + .56e,_, + 1.69m, — 2.47m,_, + 1.17p,-1 
(—2.66) (3.44) (2.44) (—3.50) (2.14) 


+ 210IL — .07SEAS; RHO = —.14, 
(5.63)  (—3.44) (—.50) 


R? (adj.) = .97, Durbin’s H = —.34. 


(20) 


The estimated coefficients are significant at the .05 level, have the 
same signs as those of equation (18), have similar magnitudes, and 
satisfy the purchasing power parity constraint; indeed, ¥j-, 7; = .94, 
which is not significantly different from one at the .05 level. 

Second, an in-period dynamic simulation of the exchange-rate 
equation was performed. This is important because of the presence of 
a lagged dependent variable and tends to show whether the estimated 
regression simply picks up the serial correlation in the dependent 
variable. The forecast error and squared forecast error were then 
regressed against time to see whether there is a tendency for it to 
increase with time. The results are as follows: 


Forecast error = .05 — .0002(time), 
(3.83)(—.14) (21) 


Forecast error = .003 + .00003(time)?. 
(2.15) (.22) (22) 


This provides rather strong support for the empirical results. 


IV. Concluding Comments 


This paper has addressed those issues in exchange-rate determina- 
tion raised by the recent work on exchange-rate dynamics. The 
exchange-rate equation developed was flexible enough to be consis- 
tent with, among others, the structural model of Dornbusch (1976) 
and a stock/flow model incorporating features emphasized by Nie- 
hans (1977), Henderson (1979), and others. Both structural mod- 
els implied that a purchasing power parity a priori constraint on the 
reduced-form parameters should be satisfied; the estimates did not 
reject this hypothesis. Other a priori constraints implied by the Dorn- 
busch model were rejected, though. There was, however, empirical 
verification of Dornbusch’s overshooting hypothesis. 
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To help summarize the empirical findings on these questions, esti- 
mated coefficients from equations (15) and (18) are used to calculate 
an exchange-rate and price-level adjustment path following a one- 
time monetary increase. To do this, equations (15) and (18) are 
transformed into final form (suppressing all random disturbance 
terms): 


€é=Cyt > imei; Cy, a constant, (23) 


i=0 


P,=Cj+ > emi, — Ciaconstant, (24) 


i=1 


where the y,’s and e¢;’s are as follows: 


Mi = Te; 

He = (713 + 1712), 

Ms = (13 + 1,712), + a274, 

M4 = (13 + 17712) T} + dg74(77, + a;), 


Ms = (13 + 1y72)1} + dgtr4 (a? + a,7, + a3), 


[on = Age Bytes te Oot + ed, + et at +... + an 
figs), 

€; = 2, 

Eo = 424), 

€3 — 420}, 


= n-1 
En >i G,Qa4 . 


Using these final form coefficients and assuming the initial exchange 
rate and relative price level are (e = 0, p = 0), then, following a 
one-unit unanticipated monetary increase, we can calculate the ad- 
justment path as follows: 
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Gy ED Po = 0 €y —po = 2.30 
eé, = .88 fp, = .29 Ci hat Pim ee 
€ég = .34 p2 = .50 Cae= Poe = ae 
és = .21 ps = .65 Cp. = ae 
é, = .27 pa = 15 64 — Ps = 48 
e, = .39 p; = .82 Cy =p, — 43 
é, = .48 pe = 87 Cau fg = 209 
e; = .54 pf; = 91 Cae Py eae 
ég = .61 ps = .94 és — pe = 39 
ég = .67 po = -96 CouniP sie eo 
Cie = 13 Pro = -97 ee Lie oe 


The exchange rate initially depreciates to 2.30, then appreciates for 
three quarters to 0.21, and then depreciates again. After 11 quarters, 
about three-quarters of the exchange-rate adjustment has taken 
place, and 97 percent of the price adjustment has occurred. Notice 
that deviations from purchasing power parity are positive for only the 
first two quarters, after that remaining negative. This means that 
exports are stimulated and imports suppressed because of increased 
relative prices for only one quarter, and then the effects are re- 
versed.'!® Note also that the exchange-rate path to long-run equilib- 
rium is, as anticipated, nonmonotonic, showing periods of both ap- 
preciation and depreciation. 

To keep these empirical results in perspective, some of the more 
serious limitations should be noted. First, the exchange-rate equation 
tested was based on a bilateral model, while in reality the n — 1 
independent exchange rates of an n-country world are functions of 
exogenous variables of all n countries. The equations estimated here, 
then, are subject to omitted-variables specification bias. If both the 
correlation of these omitted variables with the included variables and 
the expected sign of the coefficient of these omitted variables can be 
determined, then the bias in the estimated coefficients can be deter- 
mined. At present, not even the expected sign of the omitted 
coefficients has been determined.'* 

Second, the equation estimated is a reduced form from whose 


15 Note that the cumulative trade balance over the adjustment path must be zero, 
since there is no net accumulation of foreign assets. 


'6 See Berner et al. (1975) for an attempt to develop an n-country model. 
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estimates structural parameter values cannot be retrieved. While the 
reduced-form estimates seem plausible, they would be more credible 
if they were shown to be consistent with existing estimates of an 
underlying structural model’s parameters. Still, the results provide 
general support for the monetary approach to exchange-rate deter- 
mination and specific support for the importance of both slow 
commodity-price adjustments and current-account relative-price ef- 
fects within that approach.!7 


Appendix A 


I. Derivation of Reduced-Form Constraints from the Dornbusch Model 


Combining equation (7) with (6) and (2), we get: 
€; = m,(1 + 1/0A) + p,(— 1/0) + ¥4(—/Od). (Al) 
Substituting (5) for p, we have: 
lr = &-1(—43/0A) + m,(1 + 1/0) + m,_,(—a2/0X) + py-,(—a,/0X) (A) 
+ y:(—/Od) + y-_,(—ao/OX). 


II. Derivation of Reduced-Form Constraints from the Stock/Flow Model 


Replacing equation (6) of the Dornbusch model with equation (11) we get the 
following difference equation: 


e, =e,.,| 28+ ala —n/r) |. ,, Se 
: oa noO+a ( no+a 


oS —70 — n/rd — ayn/d + aoa )+ Pes] (1 = a,)n/r\ + aa, | (A3) 
noO+a noO+a 


+ Ye SS atop fe ny) : 

no+a) nO+a 
Note that the coefficient on m, may be less than one, the coefficient of e, may 
be positive, and the coefficient on p,, must be positive. 


Appendix B 


I. Switzerland 


1. Exchange rate. —End-of-period data are end-of-month single observations. 
Source: OECD Main Economic Indicators, Paris, various issues. Quarterly 
data are averages of daily noon New York rates. Source: Federal Reserve 
Bulletin, Washington, various issues. 

2. Money supply.—The M3 is taken from International Financial Statistics 
of the IMF. It appears to include some demand deposits denominated in 


17 Again, see Bilson (1978) for a discussion of how the Dornbusch approach fits 
within the general monetary framework. 
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foreign currencies. From figures supplied by. Peter Buomberger of the Swiss 

National Bank, it appears that this component is a small percentage of total 

M3 (less than 4 percent) and fairly stable. These figures were only for 22 

nonconsecutive months, which precluded using them for estimation pur- 
oses. 

3. Consumer price index.—Source: OECD Main Economic Indicators. These 

are monthly end-of-period figures. 


Il. United States 


1. Money supply. —Source: Federal Reserve Bulletin. Seasonally unadjusted data 


were used for the quarterly estimations. 
2. Consumer price index.—-Source: OECD Main Economic Indicators. 


Appendix C 


This Appendix shows that a; in the price equation may be close to zero and the 
trade balance still a function of relative prices if the trade balance is a 
relatively small fraction of aggregate demand for a country’s output. Let D be 
the ratio of domestic to foreign aggregate demand, E and E* domestic and 
foreign expenditure, respectively, that is, C + J + G, and P relative prices 
between the two countries’ outputs. The question is what implicit assumption 
makes (d log D)/(d log P) close to zero. Now, 





E+T 
D ot 
i. (Cl) 
where T is the trade balance. Hence, 
dlogD _ 1 Eat) ae a(S) 
d log P (Ei) d loge (2p dain (C2) 
Stet Pode GEV) seh yD (EA KT) 
(E +T) dP (Et aT) dP . 


Now define the trade balance elasticity as PT’ /T = e. Assuming that dE /dP = 
dE*/dP = 0 and that E = E*, we have 


d log D - De 
dlogP (E/T) —(T/E) 


Hence, (d log D)/(d log P ) goes to zero as E/T goes to infinity, despite the size 
of e. Note that this does not say that trade balance responsiveness to relative 
prices does not affect exchange rates; it only does not affect aggregate 
demand. What is more important for exchange-rate determination are the 
relative sizes of trade flows and capital flows. 


(C3) 
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Families can self-insure against uncertain dates of death through 
implicit or explicit agreements with respect to consumption and 
interfamily transfers. Interfamily transfers need have nothing to do 
with altruistic feelings; they may simply reflect risk-sharing behavior 
of completely selfish family members. Although family annuity mar- 
kets are incomplete, even small families can substitute by more than 
70 percent for perfect market annuities. Given adverse selection and 
transaction costs, family risk pooling may be preferred to public 
market annuities. In the absence of public annuities, these risk- 
sharing arrangements provide powerful incentives for marriage and 
family formation. 


The institution of the family provides individuals with risk-sharing 
opportunities which may not otherwise be available. Within the family 
there is a degree of trust and a level of information which alleviates 
three key problems in the provision of insurance by markets open to 
the general public, namely, moral hazard, adverse selection, and 
deception. In addition, provision of insurance within the family may 
entail smaller transaction costs than arise in the purchase of insurance 
on the open market. There are a number of important risks for which 
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the “public” market problems of moral hazard, adverse selection, and 
deception are especially severe. The risk of loss of job or earnings 
because of changes in the pattern of demand or partial disability is 
one example. Here the ability of the public market to determine the 
extent to which the individual actually suffered an earnings loss or is 
simply lying about his backache is highly questionable. Other exam- 
ples are the risk of bankruptcy and the default risk on personal loans. 
Many family practices in dealing with these types of risks can be 
explained as implicit insurance contracts made ex ante by completely 
selfish family members. Love and affection may be important for the 
enforcement of some of these implicit contracts, but they need not be 
their sole or even chief determinant. Healthy brother A’s support for 
disabled brother B may simply be the quid pro quo for brother B’s 
past implicit promise to support A if A became disabled instead of B. 

The existing economics literature on marriage and the family (in- 
cluding Schultz [1974] and Becker, Landes, and Michael [1977]) has 
not, to our knowledge, explicitly considered the family’s role in 
providing insurance to family members. 

This paper is concerned with family provision of insurance against 
the risk of running out of consumption resources because of greater 
than average longevity. The problem is how fast to consume over time 
when one does not know how long one will continue to live. Too much 
consumption when young may mean relative poverty later on if one 
lives “too long”; alternatively, excessive frugality when young involves 
the risk of dying without ever having satisfied one’s hunger. A com- 
plete annuity market permits an individual to hedge this uncertainty 
of the date of death by exchanging his initial resources for a stream of 
payments that continue as long as the individual survives. We demon- 
strate here that implicit risk-sharing arrangements within marriage 
and the family can substitute to a large extent for the purchase of 
annuities in public markets. Since the number of family members 
involved in the risk pooling is generally small, these family risk- 
sharing arrangements constitute an incomplete annuities market. 
However, our findings suggest that even small families can substitute 
by more than 70 percent for a complete annuity market in pooling the 
risk of death. When the economic structure of society is sufficiently 
developed to sustain organized public insurance markets, implicit risk 
pooling within an incomplete family annuity market may well be 
preferred to public purchase of annuities because of adverse selection 
and transaction costs.! When organized insurance markets do not 


1 The transaction costs we have in mind here include the time costs involved in 
negotiating individual specific annuity contracts. As we demonstrate in the text, each 
individual’s optimal annuity contract depends on his rate of time preference, his degree 
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exist, the analysis here indicates that implicit risk-sharing arrange- 
ments can provide powerful economic incentives for marriage and 
family formation. 

Throughout the paper individuals are assumed to be completely 
selfish; that is, they obtain utility only from their own consumption. 
One implication of this approach is that voluntary transfers from 
children to parents or bequests and gifts from parents to children 
need have nothing at all to do with altruistic feelings; rather, they may 
simply reflect risk-sharing behavior of completely selfish individuals. 
While altruism per se is not required, some level of mutual trust and 
honesty 7s required since elements of these arrangements are not 
legally enforceable. 

This paper is divided into four sections, the first of which describes 
optimal consumption behavior for a single individual in both the 
presence and absence of a complete annuity market. The welfare 
gains from access to a complete and fair annuity market are calculated 
for the case of the iso-elastic utility function. This welfare gain is, in 
turn, decomposed into income and substitution effects. This decom- 
position suggests that an important component of the gains from 
access to complete or incomplete annuity markets is the desirability of 
substituting future for current consumption. 

Section II develops the theoretical argument for Pareto-efficient 
implicit family annuity contracts and explores potential welfare gains 
arising from these arrangements.” Although the complexity of the 
calculations precluded analysis of large families, quantitative results 
for families of two and three persons are presented. The analysis 
considers cases in which family members both do and do not have 
identical survival probabilities (i.e., are of similar and dissimilar ages 
and sexes). This framework permits us to ask whether marriage 
between individuals with similar survival probabilities is more efficient 
than marriage between individuals with dissimilar survival prob- 
abilities. 

Optimal family annuity contracts involve agreements on the con- 
sumption path of each family member as well as a commitment on the 
part of each member to name the other members as sole heirs in his 
estate. Section III discusses the problems of enforcing both aspects of 
these agreements. Section IV summarizes the paper and suggests 
areas for future research. 


of risk aversion, and his survival probabilities. Some individuals may prefer a constant 
annuity stream, others an increasing or decreasing stream of annuity payments. 

* Kotlikoff and Spivak (1979) present a proof that family annuity contracting con- 
verges to a complete annuities market as the number of family members increases. 
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I. A Single Person’s Consumption Plans with 
and without Fair Annuities? 


In the absence of an annuity market, a single individual’s consump- 
tion choice problem is to maximize his expected utility, equation (1), 
from current and future consumption subject to the budget con- 
straint, equation (2): 


D 
EU =) P,U(C,), (1) 
t=0 
D 
> eRe =e (2) 
t=0 


The P,’s of equation (1) are probabilities of surviving from age zero 
through age t; Py equals one. The term D is the maximum longevity. 
For simplicity, we assume the utility function is separable in con- 
sumption (C,) over time. In (2) R, the discount factor, is one plus the 
interest rate. The initial wealth of the individual is Wy; we ignore 
possible streams of future labor earnings or inheritances.’ 

The budget constraint written in equation (2) is identical to the 
budget constraint that would arise in a certainty world in which 
individuals never died before age D. While individuals will, on the 
average, die prior to age D, equation (2) reflects the nonzero prob- 
ability that an individual will live through age D; that is, equation (2) is 
the relevant budget constraint because the individual may actually live 
through age D, in which case his realized present value of consump- 
tion cannot exceed his budget. 

Let us now assume that the single person is free to purchase ac- 
tuarially fair annuities in a complete public annuities market. The 
budget constraint in this case is 


D 
mi Feglint =v (3) 


t=0 


3 Yaari (1965) is the pioneering paper on this subject. Sheshinski and Weiss (1981) 
provide an illuminating discussion on the interaction of annuities and social insurance. 
Barro and Friedman (1977) provide an analysis of the risks of the uncertainty of the 
date of death. 

4 The gains from access to an annuities market are greatest when the individual has 
all his resources up front. This assumption, then, dramatizes the demand for annuities, 
but dropping this assumption would not alter the theoretical point that families can 
substitute for annuity markets. For the sake’ of completeness one can think of the 
individuals described in this paper as having received all their resource streams prior to 
their current age. In the no-annuity, no-family world involuntary bequests can be 
thought of as being collected by the government and redistributed to individuals at 
their birth. 
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In contrast with (2), (3) requires only an equality between the ex- 
pected present value of consumption and initial wealth. The single 
individual now chooses his optimal consumption path by maximizing 
(1) subject to (3); he then exchanges his initial wealth W, with the 
insurance company in return for its promise to pay out the C;, stream 
as long as the person continues to live. 

The P,;R~’s in (3) may be thought of as prices. Since each of the P,’s 
in (3), except Py) which equals unity, is less than one, the consumption 
choice in the case of a fair annuity market is equivalent to the con- 
sumption choice without an annuity market but with lower prices of 
future consumption. Obviously, access to a fair annuity market in- 
creases utility by expanding the budget frontier; it also alters the 
optimal consumption path because of the income and substitution 
effects resulting from the lower prices of future consumption. 

The iso-elastic utility function (4) is convenient for assessing the 
potential gains from access to a fair public annuities market as well as 
the gains from family annuity arrangements: 

D 
Fl ec (4) 
t=0 ay 
In (4), y is the constant relative risk-aversion parameter, and a is the 
time preference parameter. By considering different values of y we 
indicate for this family of utility functions how the gains from an- 
nuities and family arrangements depend on the specification of tastes. 

In the no-annuities case maximization of (4) subject to (2) leads to 

the consumption plan, (5): 


C oe W (Ra)'P}” 
tsar D 5 = . 
JA-YWIY ,JIY PAY 
Se ult a P} 


(5) 


In the case of fair annuities, maximizing (4) subject to (3) leads to 


C, 5) W,(R a)” 


iz 3s RiG—VI7 il P (6) 


Figure 1 compares equations (5) and (6) for the case R = a = 1. The 
ability to trade in a fair annuities market may raise or lower initial 
consumption, depending on whether y is less than or greater than 
unity (fig. 1). Intuitively, the higher the degree of risk aversion, y, the 
greater the concern for running out of money because of excessive 
longevity and, hence, the lower the initial consumption. At y equal to 
infinity, equation (5) dictates equal consumption in each period. 

Plugging (5) or (6) into (4), we arrive at two indirect utility functions 
for the no-annuity and annuity cases with initial wealth, the interest 
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Consumption 









Fair Annuities (y=1,y>1, y<1) 


No Annuities (y>1) 


No Annuities (y=1) 
No Annuities (y<1) 


Time 


Fic. 1.—Consumption paths with and without fair annuities 


rate, and survival probabilities as arguments. These functions are 
presented in equations (7) and (8), respectively, 
l Fees 2 Y 
H,(W,) = eas wer y piel : (7) 


ay 5=0 


1 . i 
V(W,) = i= wi? > mne7P, : (8) 


j=0 

The increase in utility resulting from access to fair annuities can be 
measured in terms of dollars. Equation (9) solves for the value of M, 
which represents the percentage increment in a single person’s initial 
wealth required, in the absence of an annuity market, to leave him as 
well off as he would be with no additional wealth but with access to an 
annuities market: 


H,(MW,) = V§(W,). (9) 


For the iso-elastic utility function this calculation is independent of 
the initial level of wealth. Table 1 reports values of M for different 
ages and levels of risk aversion using both male and female survival 
probabilities. Friend and Blume (1975) estimate the degree of relative 
risk aversion from individual portfolio choices. They conclude that 
risk aversion, on average, exceeds unity. We present our results for 
risk-aversion coefficients of 0.75, 1.25, and 1.75, a range that we feel 
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TABLE 1 


PERCENTAGE INCREASE IN INITIAL WEALTH REQUIRED TO OBTAIN 
Farr ANNuITIES UTiLiry LEVEL 











Relative 
Age Risk Aversion (y) Males Females 
30 Ths) 24.5 18.5 
55 75 46.9 34.4 
75 75 72 63.0 
90 75 99.8 100.2 
30 1.25 30.3 PAT 
55 1.25 59.2 43.4 
75 1.25 97.0 85.3 
90 1.25 152.6 152.9 
30 1.75 34.7 26.1 
55 1.75 68.9 50.7 
75 7D 119.1 104.6 
90 1.75 199.1 199.4 





Note.—Throughout table a = .99 and R = 1.01. 


encompasses reality. The survival probabilities used in this and all 
subsequent calculations are actuarial estimates from the Social Secu- 
rity Administration.’ Maximum longevity is taken to be 120 through- 
out the paper. 

Table 1 indicates that the utility gain measured in dollars from 
access to an annuities market can be quite large. For a relative risk- 
aversion parameter value of 0.75, the gain to a 55-year-old male is 
equivalent to a 46.90 percent increase in his initial wealth. The utility 
gain is age dependent; for y = 0.75, the 30-year-old male’s gain is 
24.46 percent, while the 90-year-old male’s gain is 99.81 percent. 
Annuities are less important to young people because a large fraction 
of their lifetime utility from consumption is fairly certain due to their 
lower mortality probabilities in the immediate future. Higher levels of 
risk aversion naturally increase the gains from access to an annuities 
market. The male-female differences in the table reflect the higher 
male age-specific mortality rates. The calculation is somewhat sensi- 
tive to the choice of a and R. Raising the interest rate to 5 percent 
while holding a@ constant increases the age 55 wealth-equivalent factor 
from 46.90 to 55.57 for the case of y = 0.75. The 90-year-old wealth 
equivalent is increased from 99.81 to 115.34. 


> We use the low mortality male and female probabilities reported on pp. 17 and 19 
of the Social Security Administration Actuarial Study no. 62 (see U.S. Department of 
Health, Education, and Welfare 1966). 
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Income, Substitution Effects, and Unintended Bequests 


Without access to an annuity market a single, nonaltruistic individual 
will always die prior to consuming all his wealth and, accordingly, will 
make involuntary bequests. The level of these unintended bequests 
can be quite large. From equation (5) we calculated the consumption 
path as well as the corresponding wealth path for the no-annuity case. 
By multiplying the probability of dying at each age times the wealth at 
each age and discounting back to the initial age, the present expected 
value of these unintended bequests can be computed. For y = 0.75,R 
= 1.01, and a = .99, the present expected value of unintended 
bequests represents 24.47 percent of initial wealth for a single male 
aged 55. This number means that a 55-year-old male with no annuity 
market will, on average, fail to consume about one-quarter of his 
wealth because he is risk averse. Increasing the risk-aversion 
coefficient to 1.75 raises the ratio of present unintended bequests to 
initial wealth to 0.3583. These large unintended bequests occur de- 
spite a fairly rapid rate of consumption. Current mortality prob- 
abilities dictate a fairly rapid rate of consumption even for high levels 
of risk aversion. For y = 1.75, a single male who survives to age 85 
consumes at age 85 less than a third of his age 55 consumption level.® 
The homothetic property of the iso-elastic utility function permits a 
decomposition of the utility gains from fair annuities into income and 
substitution effects. Suppose a fair insurance company approached a 
single, 55-year-old (y = 0.75) male and offered to pay him 24.47 
percent of his initial wealth in exchange for his naming the insurance 
company as his heir. The single male would take the 24.47 percent 
gain and, because of homotheticity, consume it according to his origi- 
nal no-annuity consumption path. This additional wealth would give 
rise to an additional .2447 x .2447 in present expected bequests. By 
letting the insurance company also pay for this second round of 
expected but involuntary bequests as well as further rounds, the 
insurance company ends up paying 32.40 = .2447/(1 — .2447) percent 
of the single individual's initial wealth. This 32.40 percent figure 
represents the utility gain from the pure income effect. In this 
scenario the individual continues to consume at the no-annuity set of 
prices. Since the total gain from being able to purchase fair annuities 
and thus face lower prices for future consumption is 46.90 percent, 
the income effect represents 69.08 percent and the substitution effect 
30.92 percent of the total gain. Hence, the ability to alter the age 


6 The ratio of consumption at age 75 to consumption at age 55 is 0.62. When risk 
aversion equals 0.75, the ratio of consumption at age 85 to consumption at age 55 
equals 0.06; it is 0.33 at age 75. 
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consumption profile is an important part of the total welfare gain 
from annuities. 


II. The Family as an Incomplete Annuities Market 


Decisions by family members concerning consumption expenditures 
and interfamily transfers may reflect implicit though incomplete an- 
nuity contracts. In the case of marriage both individuals commonly 
agree to pool their resources while both marriage partners are alive 
and to name each other as the major, if not the sole, beneficiary in 
their wills. For each partner the risk of living too long is somewhat 
hedged by the other partner’s potential death; if one partner lives to 
be very old, there is a high probability that his (or her) spouse has 
already died leaving him a bequest to help finance his consumption. 
While each spouse gains simply from the exchange of wills, the two 
can further increase their expected utilities by agreeing on a joint 
consumption path that takes into account each spouse’s expected 
bequest to the other. The importance of joint consumption planning 
is highlighted in the case of an implicit contract between a parent and 
a child. Here the parent implicitly promises to name the child in his 
will in exchange for the child’s implicit promise to care for the parent 
if the parent lives too long. Although the child may have zero prob- 
ability of dying while the parent is still alive, both can gain because the 
child agrees to share consumption resources with the parent. 

This view of bequest and consumption arrangements within mar- 
riage as an incomplete annuity market becomes intuitive when one 
contemplates increasing the number of members in the family. To 
simplify the issue, let us assume that all individuals within the family 
have identical survival probabilities and that they enter this multiper- 
son family with identical resources. In the limit as the family (or 
“tribe”) gets large, the consumption path of an individual within the 
tribe converges to the path a single individual would choose in a 
complete and actuarially fair annuities market (Kotlikoff and Spivak 
1979). 


Quantitative Analysis of Family Risk Pooling 


In the case of two family members the frontier of efficient marriage 
contracts is obtained as the solution to the following recursive 
dynamic programming problem: 


V,-,(W,-,) = max [u*(C#,) + dus(Cz,) + OP 4-1 Qut—-1V (W) 
WAGES Gi = 00= Dae 1 (10) 
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where 
V>(W,) = max u#(C#) + 6uS(C$). 
CHC} 

In (10) V,(W,) is the period t maximum-weighted expected utility of 
the two family members with joint wealth W,. In the expression the 
letters H and S denote the two family members, C# and C§ are the 
consumptions of the two, uw” and wu are their utility functions, Py,_, 
and Q,,_,; are their respective period ¢ survival probabilities condi- 
tional upon surviving through period ¢ — 1, and H,(W,) and S,(W,) 
are the maximum expected utilities for each member if he or she 
alone survives to period ¢. These expressions are obtained from equa- 
tion (7) by replacing W, with W, and applying the appropriate prob- 
abilities. The term @ is the differential weight applied to member S’s 
expected utility. 

The first two terms on the right-hand side of (10) represent utility 
from certain period t — 1 consumption. The third term is the family’s 
expected period ¢ utility multiplied by the probability that both mem- 
bers survive to period t. The last two terms represent expected utilities 
when one member dies and the other survives. 

The Appendix presents an algorithm to solve (10). The algorithm 
for solving the three-family-member maximization problem is avail- 
able from the authors.’ 


The Gains from Family Annuity Contracts 


The solution to (10) permits a comparison of consumption paths and 
utility levels of married people with those of single persons, assuming 
throughout that there is no public annuities market. Both spouses are 
assumed to have identical iso-elastic utility functions in the sense of 
the same degrees of risk aversion and rates of time preference. 
The shape of consumption paths for married couples while they are 
both alive may differ from that of single individuals for two reasons. 
First, even if the two spouses have identical survival probabilities, the 
reduction in risk within the marriage rate acts like a reduction in the 
price of future consumption. If the relative risk-aversion parameter y 
exceeds (is less than) one, the identical survival probabilities marriage 
profile will start above (below) the single person’s profile. For y equal 


7 In the three-member family we maximize a weighted sum of the three members’ 
expected utility taking all survival contingencies into account. If one of the three dies 
first, the other two jointly inherit the remaining wealth and consume according to the 
optimal two-person plan. 
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to unity the profiles are identical. In terms of figure 1, the consump- 
tion profiles for married persons lie between the no-annuity and 
complete annuity profiles. The second reason for different consump- 
tion profiles for married people relative to single individuals is possi- 
ble differences in spousal survival probabilities. Higher survival prob- 
abilities act like lower rates of time preferences. When an old man 
marries a young woman the slope of the optimal marriage consump- 
tion profile reflects the survival probabilities of both the old husband 
and the young wife. The two spouses compromise with respect to the 
rate at which they eat up their joint wealth while they are both alive. 
The old husband would prefer to eat up the wealth more rapidly, and 
the young wife would prefer to consume at a slower rate. The formula 
for each spouse’s consumption when married takes both spouses’ 
survival probabilities into account as well as the relative spousal utility 
weights. To our knowledge empirical studies of consumption and 
savings at the household level have not considered this point—that 
the time preference rate for a household may depend on the age-sex 
composition of the household. 

Table 2 reports the gains from marriage as well as three-person 
polygamy among individuals who have identical survival prob- 
abilities and identical initial endowments and who are weighted 
equally in the contract. The marriage and three-person polygamy 
gains are calculated as the percentage increase in a single person’s 
initial wealth needed to make him as well off as he would be in the 
marriage or polygamous relationship. The table also reports the dol- 
lar gain as a fraction of the table 1 total dollar gain from complete and 
fair annuities. Since utility is concave in wealth, the dollar gain from 
these family contracts as a fraction of the dollar gain from fair an- 
nuities is smaller than the actual utility gain from these contracts as a 
fraction of the utility gain from fair annuities. Table 2 also reports this 
latter fraction.® 

The figures in table 2 indicate that marriage can offer substantial 
risk-pooling opportunities. For a 55-year-old man using male survival 
probabilities, pooling risk through marriage is equivalent to about a 
20 percent increase in his wealth had he stayed single. The gains from 
marriage increase as one becomes older since the risks incurred are 
much greater as one ages. At age 75 marriage is equivalent to in- 
creasing one’s wealth by 30 percent when risk aversion is 1.25. 
Death-risk-pooling through marriage can be quite important even at 
young ages. The table reports gains from 11.7 to 13.6 percent at age 
30 using the male probabilities. 


8 This fraction is calculated as [(1 + m)!-” — 1]/{(1 + a)!-” — 1], where m is the 


fractional wealth equivalent gain from marriage, and a is the fractional wealth equiva- 
lent gain from fair annuities. 
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Marriage can also close much of the utility gap between no an- 
nuities and complete annuities. For example, for a 55-year-old with 
risk aversion of 0.75, marriage substitutes 46.10 percent for complete 
and fair annuities. Marriage is a better substitute for fair annuities at 
younger ages because at younger ages the probability that both 
spouses will die simultaneously is quite small relative to the probability 
that one spouse will die before the other. In addition, there appears to 
be an interaction between age and the degree of risk aversion, making 
marriage a better substitute for fair annuities at young ages when risk 
aversion is low and at old ages when risk aversion is high. 

Over a wide range of ages and parameter values, three people 
appear to be capable of capturing about 60 percent of the gains from 
fair annuities. While the complexity of the calculations precluded 
considering a four-person arrangement, we can conjecture using 
table 2 how well four people would do together. In the case of a 
55-year-old male with risk aversion of 0.75, adding one marriage 
partner is equivalent to a 20 percent increase in his wealth had he 
remained single. The marginal dollar gain from adding a third per- 
son (table 2) is 8.04 percent. If the marginal dollar gain fell at a 
constant rate in this range, the fourth person would add 8.04 x 
(8.04/20.0) = 3.23 percent.® By adding 3.23 to 28.04, we can roughly 
calculate the extent to which four people can close the utility gap. The 
procedure suggests that four people can substitute by 70 percent for a 
fair annuities market. 

Diminishing returns to risk pooling appear, then, to set in at a fairly 
rapid rate. In this example two people substitute by 46 percent, three 
people by 63 percent, and four people by over 70 percent for full 
insurance. 

Table 3 considers incomplete annuity arrangements between two 
parents and one child and between one parent and two children. In 
both cases we assume equal consumption by all family members but 
permit the initial wealth of the child or children to vary. All individu- 
als are assigned the male survival probabilities; the children are age 30 
and the parents age 55. In the case of two parents with one child, if 
the child has an initial wealth of $35,000 and the parents have an 
initial wealth of $20,000, entering into an equal consumption—will- 
swapping arrangement is equivalent to a 32 percent increase in wealth 
for each parent and a 10.6 percent increase for each child. For the 
parent this arrangement captures 71.2 percent of the utility gain from 


* This is probably a lower bound estimate for the contribution of the fourth person; 
the marginal dollar gain cannot fall at a constant 40 percent rate forever, because if it 
did the total dollar gains would, in the limit, not sum up to 46.9 percent, the full annuity 
gain of table 1. Presumably the marginal dollar gain falls at a decreasing rate, and 3.23 
percent probably underestimates the fourth person’s marginal contribution. 
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TABLE 3 


GAINS FROM INCOMPLETE ANNUITY ARRANGEMENTS IN THE FAMILY 














Two PARENTS WITH Two CHILDREN WITH 
ONE CHILD ONE PARENT 
INITIAL WEALTH Dollar Gain Dollar Gain Dollar Gain Dollar Gain 
OF EAcH CHILD to Parent to Child to Parent to Child 
($) (%) (%) (%) (%) 
25,000 14.4 34.2 255 24.8 
30,000 Dor? 20.4 16.9 18.9 
35,000 32.0 10.6 31.5 14.6 
40,000 40.8 Seo 46.1 11.5 








Norte.—The calculations assume equal consumption by all family members. Initial wealth of parent or parents is 
$20,000. R = 1.01, a = .99, and y = 0.75. 


full annuities; for the child the arrangement substitutes by 45.4 per- 
cent for full annuities. The last two columns of table 3 present the case 
of two children contracting with one parent. When each child con- 
tributes $35,000, the gain to the parent is 31.5 percent, while each 
30-year-old child enjoys a 14.6 percent gain relative to consuming as a 
single person. The numerical differences in the table for the two 
different types of families reflect, on the one hand, different mone- 
tary contributions of parents relative to children and, on the other 
hand, differences in the rate at which resources are consumed when 
all family members are alive. Resources are consumed at a slower rate 
in the two-children—one-parent case than in the one-child—two-parent 
case, since each individual’s survival probabilities are given equal 
weight in determining the optimal rate of consumption. 


Is Marrying People of Similar Ages More Efficient? 


Suppose one had to decide how to pair up four people, two who are 
old and two who are young. Is it more efficient to marry the old 
people together and the young people together than it is to mix ages? 
Intuitively, marrying two 90-year-olds together and two 20-year-olds 
together leaves a large chance that both 90-year-olds will die in the 
immediate future, and resources that they have failed to consume will 
be lost to the 20-year-olds who, on average, will still be alive.!° The 
countervailing argument against mixed-age marriages is that mixing 


10 We assume here that any involuntary bequests that arise from the simultaneous 
death of both marriage partners or from the death of a surviving spouse are not 
inherited by any of these four individuals. Again the government can be thought of as 
collecting these residual bequests and distributing them each year to the newborn. We 
thank Finis Welch for a helpful discussion on this section. 
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ages involves greater risk to one of the two partners; the utility cost of 
this greater risk may exceed the utility gain from the increase in 
expected resources arising in mixed marriages." 

We investigated potential efficiency gains from mixed marriages 
between two 55-year-olds and two 30-year-olds, where each individual 
was risk averse at the 0.75 level. The 55-year-olds were assigned male 
survival probabilities, while the 30-year-olds were assigned female 
survival probabilities. When risk aversion equals 0.75, weights of 1.7 
for the old person yield utility levels for both old and young which 
exceed those in the old-old, young-young marriages of table 2. The 
additional dollar gain to the old person from this weighted marriage 
with the young person is 3.1 percent; the added gain to the young 
person is 1.6 percent. 

These additional gains from mixed-age marriages require, how- 
ever, a fairly skewed distribution of consumption within the marriage. 
For this example the young-old weighting scheme that dominates 
old-old, young-young coupling involves the older spouse’s consuming 
about 86 percent more than the younger spouse while they are both 
alive. If it is too costly to negotiate such an arrangement within the 
marriage or if the type of consumption (e.g., housing) within mar- 
riage is nonexcludable, then equal consumption marriages of indi- 
viduals with similar survival probabilities (of similar ages) will be the 
rule rather than the exception. Of course, we have been discussing 
here marriages in which each spouse has the same initial dowry. The 
old-young marriages can dominate old-old, young-young marriages 
even under an equal consumption arrangement provided the dowry 
of the young spouse sufficiently exceeds that of the old spouse. 


III. Enforcement with and without Altruism 


In the absence of altruism would family members voluntarily main- 
tain these implicit contracts as family members age? The answer is 


11 To see this consider an old-young marriage in which the young person promises to 
consume less than the old person in the state of nature in which both spouses survive. 
Suppose that this promise to the old person of higher consumption in the “both 
survive” state is large enough to exactly compensate the old person for the loss in 
expected utility from the state in which his spouse dies but he survives. The old person’s 
expected utility from this latter “bequest” state is lower when he marries someone 
young, rather than someone old, because the probability of the young person actually 
dying is smaller. While the old person is by assumption no worse off in this compen- 
sated old-young marriage, the young person could be worse off than if he had married 
someone young. By entering into the compensated old-young marriage, the young 
person reduces his payoff in the both survive state while leaving the payoff in the 
bequest state unchanged. He also increases the probability of the bequest state and 
lowers the probability of the both survive state. Although expected consumption for the 
young spouse rises, the spreading of the payoffs may lower expected utility, depending 
on the young spouse’s degree of risk aversion. 
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that there always are ways of structuring payments to individuals 
within the family so that each individual at each moment in time has a 
selfish interest in maintaining the original implicit contract. An equal 
consumption marriage contract between two individuals with the 
same survival probabilities and the same initial endowment is a good 
first example. If each spouse maintains control over his own wealth 
while both spouses are alive and consumes at the same rate as the 
other spouse, then each will separately have an incentive to continue 
the contract at each point in time. A similar type of individual control 
can be maintained in family arrangements; rather than have the 
parents use up all their resources before the children begin con- 
tributing to their support, the children can contribute each period in 
return for that period’s expected parental bequest. This scenario of 
parents’ maintaining control over their wealth until the very end, as 
enforcement leverage over their children, may partly explain the 
limited use of gifts as a tax-saving intergenerational transfer device. 

The proof of this proposition is immediate from equation (10). 
Given their initial endowments at time t — 1, WH, and W#_,, family 
members choose a value @* such that the contract to consume the 
contingent plan [C#,(0*), C#_,(6*), C#(@*), C$(0*), ... , CH(6*), CS(6*)] 
is in the core at time ¢ — 1. The consumption plan at time ¢ [C#(6*), 
C7(0*), . . . , C#(0*), C§(O*)] represents the period t Pareto-efficient 
contract corresponding to the initially chosen utility weight 6*. This 
plan is in the core for a set S, of individual endowments of family 
members in period t, WH? and W?, which satisfy WF + W? = W,. To 
insure that the initial contract remains a core allocation for each 
family member, side payments are made at time ¢t — 1 when con- 
sumption in period t — 1 occurs. The side payments leave the period t 
individual endowments in set S;. Since the initial contract [C#,(6*), 
C#_,(0*); C#(0*), C2(0*), .. . , C#(0*), CZ(0*)] is in the core, each selfish 
family member will have a personal incentive to make or accept these 
side payments. 

There are two additional questions of enforcement to consider. 
One problem is that a spouse may covertly name a third party as 
beneficiary in his will in exchange for the same commitment by the 
third party or in exchange for a particular service. A second type of 
cheating may occur when one or both spouses covertly consume in 
excess of the consumption levels dictated by an optimal implicit mar- 
riage contract; while each spouse may correctly believe that he or she 
is the beneficiary in the other spouse’s will, each may try to take 
advantage of the other by increasing his own consumption and thus 
reducing the potential bequest available to the other spouse. 

These two types of cheating will be more problematic for implicit 
incomplete annuity agreements between friends or relatives who are 


388 JOURNAL OF POLITICAL ECONOMY 


physically separated. The consumption cheating scenario can be 
modeled as a Nash equilibrium in which each partner chooses his 
consumption path by taking the other partner’s consumption path 
and potential bequest path as given. Resources are consumed at a 
faster rate in the Nash equilibrium as each partner fails to consider 
how his consumption will diminish his expected bequest and thus the 
expected utility of his partner. 

Using male survival probabilities we calculated for two 55-year-olds 
the dollar equivalent utility gain from engaging in a Nash 
consumption-cheating partnership. The gains in the Nash equilib- 
rium proved to be almost identical to those in the more efficient 
marriage contract. For levels of risk aversion OOF 5R1k2 5s andale75; 
the percentage dollar increments are, respectively, 19.9, 22.2, and 
23.5. While the rate of consumption is faster in the Nash equilibrium, 
it is not much faster than in the marriage contract. Intuitively cheat- 
ing by overconsuming is fine provided one’s partner actually dies; but 
if one’s partner survives, the early excessive consumption will require 
relative deprivation later on. Apparently this latter consideration 
dominates the former, leaving utility in the cheating equilibrium at 
essentially the same level as under a marriage contract. These exam- 
ples suggest that consumption cheating does not represent a substan- 
tial impediment to consumption-risk-sharing arrangements. 

Another means of enforcing these implicit contracts is simply al- 
truism. All of our calculations have involved maximizing a weighted 
sum of individual family members’ utilities. If, however, each family 
member is altruistic toward each other and each weights each family 
member’s utility from consumption in the same way, then all family 
members would unanimously agree on the utility maximand. The 
calculations we have presented can, therefore, be thought of as re- 
sulting from the maximization of an agreed-upon altruistic family 
utility function. Since all family members agree on the maximand, 
there is no problem of enforcement. 


IV. Summary and Conclusion 


This paper has demonstrated that consumption and bequest-sharing 
arrangements within marriage and larger families can substitute to a 
large extent for complete and fair annuity markets. In the absence of 
such public markets, individuals have strong economic incentives to 
establish relationships which provide risk-mitigating opportunities. 
Within marriages and families there is a degree of trust, information, 
and love which aids in the enforcement of risk-sharing agreements. 
Our calculations indicate that pooling the risk of death can be an 
important economic incentive for family formation; the paper also 
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suggests that the current instability in family arrangements may, to 
some extent, reflect recent growth in pension and social security 
public annuities. The methodological approach of this paper can be 
applied to the study of family insurance against other types of risks. 
Of chief interest are those types of risks that are handled very poorly 
by anonymous public markets. Disability insurance and insurance 
against earnings losses are good examples. 

Our approach has been to compare family insurance with perfect 
insurance. It would seem worthwhile to compare family insurance 
with public market insurance where the market insurance is subject to 
adverse selection and moral hazard problems and family insurance is 
not. Realistic specification of the degree of adverse selection and 
moral hazard may indicate that family insurance dominates public 
market insurance even in small families. 

Finally, the paper suggests the empirical difficulty of determining 
whether intergenerational transfers reflect altruism or simply risk- 
mitigating arrangements of essentially selfish individuals in the ab- 
sence of perfect insurance markets. Distinguishing between the selfish 
and altruistic models is fundamental to a number of major economic 
questions, including the impact of the social security system on na- 
tional saving and the effectiveness of fiscal policy.” 


Appendix 


Computational Algorithm for the Two-Family-Members Dynamic 
Risk-pooling Problem 


This Appendix indicates the algorithm used to solve the two-family-members 
dynamic programming problem, copied here as equation (Al). The al- 
gorithm for the case of three family members is similar to that for two 
members and is available from the authors. While we consider the iso-elastic 
family of utility functions, our algorithm can be applied to any homothetic 
utility function. 
V_-4(Wy_,) = max [wu (CL) + Ou (CP) + AP yp Qut-V (Wr) 
Wit ei Or=T,...51 (Al) 


+ a@P y-1(1 = Qip-D)A UW) + 0aQye (1 — Pes) 5(W,)], 
subject to 
WIR + CL, + Gy = We. (A2) 
Again, the letters H and S correspond to the two family members with 
respective conditional survival probabilities P,,-, and Q,,_,. The expression 


W, is joint family wealth, @ is the weighting factor, and H,(W,) and S,(W;,) are 
the expected utility levels for each family member if he alone survives to 


period t. 


12 See Barro 1974. 
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Optimal values for C# and C# are found recursively starting at period T and 
proceeding to period 0. We demonstrate that V,(W,) may be written in the 
form: 





1%, 
V(W,) = 1, >, (A3) 
daszy 
where 2, is a constant. We also show that total family consumption, C;, is given 
by 


. op” 
Comes Tarae RE (A4) 


where K, is another constant. Given total family consumption, consumption of 
the two members is 
Cy-1 


CE ie piggies acta 


guy 


L560 | a 


We demonstrate that K, is a function of v, and that v,, is a function of K,. 
Starting then at the initial value for K,, K74,, we can compute v7; v7 in turn 
gives K,, which in turn gives v7_,. Proceeding in this fashion to period zero we 
compute the entire sequence of v,'s and K;’s. These values can then be used in 
equation (A4) to compute the ratio of consumption to wealth at each period. 
These ratios together with an initial level of wealth plus (A2) and (A5) 
generate the optimal consumption path. The homotheticity of the utility 
function permits us to calculate recursively the shape of the consumption 
path independently of the initial level of wealth. 
Starting with period T the maximization problem for equation (A1) is 


— (CH +6—4 
ay a 
st. CH+Ci<Wp, CH,Cl>0. 








V7>(Wr) = max i (Cz)-” 


Solving this maximization and computing the indirect utility function for V7, 
we have 





VWs) = op + _ wy, where vp = (1 + 6"), (A6) 
" l = guy 
ta git ae aa 


For t < T, (Al) for the iso-elastic case is written as 





V,-4(W,4) = max —+_ (Ca)? + 9 (G3 yer 
Grice ly lez 














P, Q, 1 Ey Q 1 
+a v Wr? + a —+— ( — )p wi 
Pe Q.-1 i Y i Pe Qi ‘ hes ‘ 


a Aa 1 = = Q 5; —_ wer 
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st CH, +C8,+ = ae 
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In going from (A1) to (A8) we use the fact that A(W,) =h{W}-7/(1 — y)] and 
S(W,) = s{Wi-7/(1 — y)] for the iso-elastic utility function. The values for h, 
and Ss; are implicitly defined as the bracketed term in equation (7) in the text 
with 7 = 0 corresponding to time t and with each family member’s survival 
probabilities from time t¢ substituting for P,. 

It is easy to see from (A8) that for given total family consumption, C,, C# 
and C? will always satisfy (A5). Hence we may rewrite (A8) as 





Cha (A9) 


V-4(Wi-1) = max vp 
Ct—-1 


ay 
pene ese es ( P cae +0 ( _ Saige 
bi ye ee O74 Pi Qy-1 Py] Qe 


Denoting the term in brackets by K, we now have 
V,-, (W,_1) = max vp : 
Cig dey. 


et 6.2 1B = W,.. 








Cae = WI'K, (A10) 
me) 


Maximizing (A10) and computing the indirect utility functions yields 


wy = ler ih (ARK) PRY, (All) 


COR 


Cri = Wei ay aK R)PRo 


(A12) 
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An Economic Theory of Self-Control 





Richard H. Thaler 
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The concept of self-control is incorporated in a theory of individual 
intertemporal choice by modeling the individual as an organization. 
The individual at a point in time is assumed to be both a farsighted 
planner and a myopic doer. The resulting conflict is seen to be funda- 
mentally similar to the agency conflict between the owners and 
managers of a firm. Both individuals and firms use the same tech- 
niques to mitigate the problems which the conflicts create. This 
paper stresses the implications of this agency model and discusses as 
applications the effect of pensions on saving, saving and the timing 
of income flows, and individual discount rates. 


For many years Christmas clubs paid no interest. Members deposited 
money each week but could only withdraw the money on December 1. 
The clubs were very popular, although they seemed to be dominated 
by simply depositing money in an interest-bearing savings account. 

Passbook loans do still exist. These loans allow an individual with 
$5,000 in a savings account earning 5 percent interest to borrow at 9 
percent using the balance as collateral, instead of at 10 percent with 


The authorship of this paper was fully collaborative. The order of the names on this 
paper and its companion (Shefrin and Thaler 1980) corresponds to our division of 
labor. This paper was written while Thaler was a visiting scholar at the National Bureau 
of Economic Research at Stanford, California. While there he received financial sup- 
port from the Kaiser Family Foundation and the Robert Wood Johnson Foundation. 
Helpful comments on earlier drafts were provided by George Ainslee, James Bucha- 
nan, Tom Russell, and numerous other friends and colleagues. We thank them all. 


[Journal of Political Economy, 1981, vol. 89, no. 2]. 
© 1981 by The University of Chicago. 0022-3808/81/8902-0008$01.50 


Do 


THEORY OF SELF-CONTROL 393 


no collateral. Obviously the individual could simply withdraw the 
money from his savings account at an (opportunity) cost of only 5 
percent. 

Smoking clinics are a new and thriving business. A smoking clinic 
will help people who want to stop smoking—for a fee of several 
hundred dollars. 

What does economic theory have to say about these institutions? 
George J. Stigler provides the following analysis: 


One can of course explain the participation in a Christmas 
fund by introducing another item of preference: a desire of 
people to protect themselves against a future lack of 
willpower. . . . If we stopped the analysis with this explana- 
tion, we would turn utility into a tautology: a reason, we 
would be saying, can always be found for whatever we ob- 
serve a man to do. In order to preserve the predictive power 
of the utility theory, we must continue our Christmas fund 
analysis as follows. The foregone cost of putting money in a 
Christmas fund is the interest one could earn by putting the 
same money in a savings account. If interest rates on savings 
accounts rise, the cost of buying protection against a loss of 
willpower rises and less of it ought to be bought. . . . [Stigler 
1966, p. 57] 


We agree with the remarks above and therefore offer a model that 
says more about these institutions than the fact that the demand for 
their services will be negatively related to price. We do so by propos- 
ing a simple extension of orthodox models, using orthodox tools, that 
permits such behavior to be viewed as rational. This rationalization is 
based on an analysis of the technology of self-control using the theory 
of agency rather than reliance on ad hoc explanations in which trans- 
action costs, taxes, and income effects play a major role. Our model 
can predict, in a nontautological way, the circumstances in which 
these kinds of behavior will be observed. In particular, our new theory 
of intertemporal choice has important implications for theories of 
saving behavior. In the last section of the paper we discuss some of 
these implications and offer empirical evidence in support of our 
ideas. 


I. The Model 


The idea of self-control is paradoxical unless it is as- 
sumed that the psyche contains more than one energy 
system, and that these energy systems have some degree 
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of independence from each other. [DONALD McInTosH 
1969] 


Why individuals would impose constraints on their future behavior is 
a problem that has received attention from economists since Strotz’s 
(1955-56) classic paper.’ Strotz and those who have followed him 
(Pollak 1968; Blackorby et al. 1973; Peleg and Yaari 1973; Hammond 
1976; and Yaari 1977) have analyzed the phenomenon as one of 
changing tastes. In Strotz’s formulation a conflict occurs between to- 
day’s preferences and tomorrow's preferences if the discount func- 
tion used today is not exponential with a constant exponent. 

Our framework differs from the changing-tastes literature in that 
we model man as having two sets of preferences that are in conflict at 
a single point in time. This idea is certainly not new. Adam Smith used 
a two-self model much like ours in his Theory of Moral Sentuments 
(1759). More recently Schelling (1960, 1978) and Buchanan (1975) 
have recognized the importance of simultaneous conflict in under- 
standing self-control problems. Outside economics the idea is com- 
monplace, with the writings of Freud (1958) and Berlin (1969) de- 
serving special mention. 

Nonetheless, to the best of our knowledge our work is the first 
systematic, formal treatment of a two-self economic man. We have 
adopted a two-self model because, as McIntosh says above, the notion 
of self-control is paradoxical without it. Furthermore, we utilize an 
organizational analogy that leads to both insights into human behav- 
ior and a rich explanatory model. We will briefly describe the model 
here. We do so in order to make explicit the nature of our two-self 
conceptualization. The model also leads to specific predictions about 
behavior, which are discussed in Section III below. Further details are 
available in our more formal companion paper (Shefrin and Thaler 
1980). 

Our model is cast in discrete time. Consider an individual with a 


fixed income stream y = (yi, yo, ---» yr). Think of period T as 
retirement and let yr = 0. Let the individual choose a nonnegative 
level of consumption c, int. Callc = (¢,,¢2, ... ,¢7) a consumption plan. 


The conflict between short-run and long-run preferences is intro- 
duced by viewing the individual as an organization. At any point in 
time the organization consists of a planner and a doer.” The planner is 
concerned with lifetime utility, while the doer exists only for one 
period and is completely selfish, or myopic. The period ¢ doer is as- 
sumed to have direct control over period t consumption rate c;. The 


1 Two important contributions by noneconomists are Ainslee (1975) and Elster 
(1977). 
2 These terms originated in an early draft of Thaler’s (1980). 
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doer’s utility function is given by Z,(-); Z, is taken to be independent 
of all components of c except c;.2 Furthermore, suppose initially that 
Z; is strictly increasing and concave in ¢. 

In our model the planner does not actually consume but, rather, de- 
rives utility from the consumption of the doers. Therefore the plan- 
ner’s utility function is given by V(Z,,Z.,...,Z7). Observe that a plan 
which maximizes V subject to the present value budget constraint, >/_, 
ea nay yt = Y, is considered optimal from the planner’s point of view. 
However, without some method to control the doer’s actions, this plan 
cannot be implemented. Indeed, under the assumptions above, the 
actual consumption stream chosen would have total lifetime income 
consumed during the first period, when the period-one doer would 
borrow Y — y, on the “perfect” capital market. In order to prevent this 
from occurring, the planner requires some psychic technology capa- 
ble of affecting the doer’s behavior. Two main techniques are avail- 
able for this: (1) The doer can be given discretion in which case either his 
preferences must be modified or his incentives must be altered, or (2) the 
doer’s set of choices may instead be limited by imposing rules that 
change the constraints the doer faces. 

We begin by analyzing the case in which no rules are used. We refer 
to this case as pure discretion. While we do not believe that the pure 
discretion case is empirically important (most people appear to use at 
least some kinds of rules), it provides a useful foundation for our 
model, to which rules are easily added. Furthermore, since this case 
corresponds closely to that usually considered in economics, it high- 
lights the differences between our model and the standard frame- 
work. 

Recall that Z,(-) was assumed to be unbounded. We now specify that 
Z, depends on a preference modification parameter 6, selected by the 
planner. The choice of @, allows the planner to alter Z, such that it 
possesses an internal maximum. By appropriately selecting 6,, any 
desired c, may be obtained; however, the lower the desired c, 1s, the 
more modification will be required. Furthermore, 0Z,/06, is negative; 
that is, modification reduces short-run utility and is therefore costly. 
Finally, we assume that the marginal cost of modification increases 
with 6. Thus successive reductions in c, require increasing reductions 
1+, 

If the planner could exercise direct control over the choice of a 
consumption plan, 6, would be set equal to zero for all ¢ (because 
modification is costly) and c would be selected to maximize V subject to 
the budget constraint. Since under pure discretion this is assumed to 


° The extreme assumption that doers do not care at all about past or future doers is 
adopted just for expositional simplicity. Other arguments of Z, could easily be added. 
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be infeasible, the planner must instead choose 0 = (6;, 92, . - - , 9r) 
to maximize V. (The solution to the problem is described in Shefrin 
and Thaler [1980].) Essentially, modification is increased until the 
marginal utility derived from additional consumption in retirement 
(period T) equals the marginal loss in utility in earlier periods due 
to modification. 

In the more general case when both rules and preference modifica- 
tion are permitted, the planner may also alter the budget constraint 
facing each doer. This allows the planner to reduce ¢; without incur- 
ring modification costs. However, since available rules are imperfect 
(see next section), the planner will have to trade off modification costs 
with the opportunity costs associated with using second-best-type 
rules. Rules are formally incorporated into the planner-doer model in 
Shefrin and Thaler (1980). 


II. Techniques to Reduce Conflicts in Individuals 
and Organizations 


We have characterized self-control as an internal conflict resembling 
the principal-agent conflict between the owner and manager of a firm 
(see Ross 1973; Jensen and Meckling 1976). In this section we de- 
scribe the actual techniques used by individuals and firms to mitigate 
these conflicts. The techniques fall into the two categories highlighted 
in the model: rules and incentives. We provide many illustrations of 
the methods individuals use because these are in essence part of our 
model. When individuals use rules it is impossible to characterize their 
behavior simply with first-order conditions. The limits on the kinds of 
rules which individuals will find feasible lead to the specific predic- 
tions about saving behavior discussed in Section III. In this section we 
also compare the techniques individuals use with those used in firms. 
The close correspondence we find lends intuitive support to our 
principal-agent model. 


A. Methods to Alter Incentives 


Individuals use three basic techniques to alter the doer’s incentives. 
First, the doer’s preferences can be modified directly. Some individu- 
als consider saving a good in and of itself.4 In this case doer myopia 
does not inhibit saving. Second, inputs to a saving or dieting program 
can be explicitly monitored via weekly budgets or calorie counting 


4 This idea has been suggested by Scitovsky (1976). The importance of norms in 
controlling individual behavior is also stressed heavily by Adam Smith (1759, p. 326) 
and by Irving Fisher (1930). 
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(customers of diet clinics and credit counselors are advised to do this). 
Simply keeping track seems to act as a tax on any behavior which the 
planner views as deviant. Third, incentives can be explicitly altered: 
Alcoholics take the drug Antabuse which makes them ill if they take a 
drink; academics agree to give a paper at a conference to provide 
a proximate incentive to write it. 

' Firms use the same three methods. First, profit-sharing plans are 
quite popular even though they offer only trivial financial incentives 
to all but the highest executives. We believe firms adopt them because 
they help create an atmosphere in which the employees’ preferences 
are more similar to those of the owner. Second, firms monitor de- 
partmental inputs through cost accounting and then tie compensation 
to these input measures. Third, departmental profits are measured 
and used as performance measures for managers. 


B. Methods to Alter Opportunities: Rules 


If the costs of monitoring and persuasion are high, individuals will 
resort to rules that restrict the doer’s opportunities. In the extreme, 
all doer discretion can be eliminated using what Strotz referred to as 
the strategy of precommitment. Such behavior is rational in our 
model if the rule can approximate the choices that the planner would 
select. Market precommitment institutions are observed, as we would 
predict, in such areas as saving and dieting. For example, people pay 
to go to “fat farms” which essentially are resorts that promise not to 
feed their customers. 

Less extreme rules can limit the range of doer discretion, usually 
through the use of self-imposed rules of thumb. In the savings con- 
text several such rules appear to be commonly used. These rules alter 
the budget constraint faced by the doer in much the same way as 
credit limits imposed by lenders do.® A simple first departure from 
pure discretion is a ban on borrowing, the so-called debt ethic. A 
somewhat weaker rule which seems common is to prohibit borrowing 
except for specific purchases, like houses and automobiles. Another 
rule of thumb is a prohibition on dissaving combined with limits on 
borrowing. Using this rule of thumb, a person might borrow and lend 
simultaneously in spite of a substantial difference in the interest rates, 
as in the case of the passbook loan. The loan allows him to transfer 
consumption across time periods while it provides a regimented re- 
payment scheme. : 


° While it is difficult to document the extent to which these precise rules are used, 85 
percent of Cagan’s (1965) sample of Consumers Union members reported using one of 
these or similar rules to determine monthly saving. 
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Rules can also eliminate discretion over a specific class of decisions 
for which the conflict is particularly acute. Dieters try not to keep 
cheesecake in the refrigerator and will refuse invitations to lavish 
dinner parties; problem gamblers avoid Las Vegas. Also, many smok- 
ers pay more for their cigarettes by buying them by the pack instead 
of the carton—it helps enforce a self-imposed ration such as one pack 
a day. 

Again, the same types of rules are observed in organizations.® 
“Pure” rules are observed most often in bureaucratic organiza- 
tions because the costs of monitoring output are so high. Rules that 
limit managerial discretion over a particular range are frequently in 
the form of guidelines (e.g., a plant manager can adapt any invest- 
ment that exceeds some stated rate of return). Similarly, rules may 
prohibit discretion over a specific class of decisions; loan officers, for 
example, might need approval for loans to relatives or friends. 

We wish to make three other points about internal rules of thumb. 
First, it is useful to consider these rules as learned as much as 
chosen. Rules like the debt ethic are learned from parents and other 
models, which suggests that there will be differences in the use of 
rules depending on social class, education, and age. Second, rules of 
thumb are likely to become habits. By establishing a routine, the doer 
decision process can be avoided. Third, to the extent that the rules do 
become habits, there will be rigidities built into the individual’s be- 
havior.’ The implications of these observations are discussed in the 
next section. 


IlI. Implications 


We now turn to a discussion of the implications of the planner-doer 
model. What predictions about behavior can be made with our model 
that are inconsistent with the standard model? Some of these predic- 
tions are obvious. We predict that people will rationally choose to 
impose constraints on their own behavior. Furthermore, we predict 
that such precommitments will occur primarily for those goods whose 
benefits and costs occur at different dates. We present here some less 
obvious predictions based on our model. 

Because our framework is richer than the standard theory, vari- 


6 For a discussion of firms imposing constraints on their future financial policies, see 
Myers (1977). 

7 Habits can be formally introduced in two stages. At first stage Z, can also be made a 
function of 6,, 5 < t, to reflect the fact that self-control at the early dates renders 
self-control at later dates less costly. At the second stage, Z,; can also be parameterized 
on the adopted rule. In this case the function Z, could exhibit an internal maximum in ¢, 
when 6, = 0. It is interesting to note that modification would now be required in order 
to break the rule. This would tend to explain miserliness, for instance. 


THEORY OF SELF-CONTROL 399 


ables that the standard theory treats as irrelevant differences in form 
we model as differences in substance. In Sections II[A and IIIB we 
consider how differences in the form of payment (holding the level 
constant) affect saving decisions, and in Section IIC we present 
variables other than borrowing and lending rates that determine 
individual marginal rates of time preference. 


A. Pensions and Saving 


Consider two identical individuals with the same total income and 
wealth. Assume that they both save some fraction s of their income. 
Now give one a mandatory pension plan that forces him to save p <s. 
What will happen to total saving? Though it is difficult to get a specific 
prediction from the standard model (see Feldstein 1977), a first-order 
prediction would be that total saving is unaffected. Other forms of 
saving should fall by the approximate amount of the pension. 

Our model has a different prediction: The pension plan produces 
Saving at no psychic cost. Modification costs occur only when Saving is 
voluntarily withheld. This is what we call “discretionary saving.” Thus 
since the marginal cost of saving is lower at the old saving level, we 
expect total saving to go up. (In other words, the offset in other saving 
will be less than the size of the pension.) 

Furthermore, in two cases the offset will be essentially zero: (1) The 
individual uses a saving rule such as “save s percent of disposable 
income” which is not changed when the pension is introduced (total 
saving increases by [1 — s]py,), and (2) the individual uses discretion 
but treats saving as a good for its own sake rather than as a transfer to 
future consumption. In this case discretionary saving and retirement 
saving are not perfect substitutes as in the standard model; in fact, 
their cross-elasticities of demand could be zero. 

The effect of pensions on saving has been investigated with indi- 
vidual data by Cagan (1965), Katona (1965), and Munnell (1974, 
1976). All obtained similar results. Cagan used a sample of Consum- 
ers Union members. He found that for those members with a man- 
datory pension plan, other saving actually was higher than for those 
without pensions (see Cagan 1965, p. 21). Munnell (1974) replicated 
Cagan’s study using the same data source. She used a different mea- 
sure of saving, replaced before-tax income with after-tax income, and 
restricted her analysis to a subset of observations she thought to be 
more reliable. She then regressed the nonpension saving to income 
ratio on several variables including a pension dummy. Her basic result 


8 A formal treatment of the effect of pensions on saving appears in Shefrin and 
Thaler (1980). 
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was that the pension had no effect on other saving (i.e., a zero offset). 
The coefficient of the pension dummy was never significant. Its high- 
est ¢ value occurred for the 55-65 age group (¢ = 1.2), for which those 
with pensions saved 3 percent less than those without pensions. 
Though other explanations (such as selectivity bias) have been offered 
for these results, we find that they lend support to our model. 


B. Saving and the Timing of Income Flows 


The importance of current disposable income (as opposed to perma- 
nent income) yields another prediction from our model that differs 
from the standard theory. Consider two identical individuals, S and 
B; S receives a salary of $12,000 per year paid in 12 monthly install- 
ments of $1,000, while B receives a salary of $10,000 per year paid in 
monthly installments plus a guaranteed bonus of $2,000 paid in 
March each year. Standard theories of saving behavior would predict 
that these two individuals would save the same amount. Our model 
predicts that, on average, B will save more. 

Although we know of no test of this hypothesis,’ there is one bit of 
circumstantial evidence. In Japan, where there is a very high saving 
rate, bonus schemes are quite common. We think this is no coinci- 
dence. A test would be possible, given the right data. We predict that 
individuals who are paid a portion of their salary via a lump-sum 
bonus will have higher saving rates than those who receive their 
compensation in a smooth pattern. How does this follow from the 
model? 

We have characterized saving behavior primarily as a set of self- 
imposed rules of thumb and externally enforced saving plans. For an 
individual like B, those rules and plans will be based on his regular 
monthly income. Contributions to pension plans, payments on whole 
life insurance policies, mortgage payments, and so forth must be made 
on a regular basis. Furthermore, most individuals prefer to have their 
monthly inflows and outflows roughly balance.’° For B to act like S, he 
could deposit the bonus in the bank and draw it down gradually 
during the year (as if his salary were $12,000), or he could borrow the 
$2,000 over the course of the year and repay the loan when the bonus 
is paid. However, we feel that neither of these behavior patterns is 
likely to be widely observed. Notice that they violate either the ban on 
borrowing or the ban on dissaving. To the extent that these rules of 
thumb are used, imitating the behavior of S will be difficult. We 


® However, a related issue is investigated by Landsberger (1966). 

© This explains why many teachers sacrifice interest by electing to receive their 
academic-year salary paid in 12 monthly installments (September—August) rather than 
10 (September—June). 
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believe that for the typical individual much of the bonus will end up 
being saved, especially through the purchase of durable goods. This 
amounts to the use of an auxiliary rule regarding the disposition of 
bonuses received. We expect that total saving for B will exceed that 
for S because of the technology of self-control. By paying the bonus 
the firm is acting as an external self-control device (much like parents 
who tell children that money gifts at Christmas must go into a savings 
account). Temptations to spend during the year will be overcome 
because the smaller monthly salary will make them seem beyond the 
individual’s means. This technology seems to have been recognized by 
the millions of taxpayers each year who claim too few exemptions in 
order to assure a tax refund.!! Obviously some self-deception (doer 
deception) is necessary for this device to work, but doers can appar- 
ently be deceived quite easily. How else can one explain the not 
uncommon practice of knowingly setting one’s watch a few minutes 
ahead (“in order to get to places on time”)? 

As the analysis in this section suggests, the shape of the income 
stream will affect the type of saving strategy adopted. Those individu- 
als with variable and uncertain incomes will find a discretionary rule 
such as saving s percent each month difficult to enforce in the low- 
income months. Without a mandatory saving plan, they would have to 
adopt some more complex strategy to save effectively. Similarly, those 
individuals whose incomes are expected to decline (such as profes- 
sional athletes) would prefer to save a large proportion of their high 
current incomes and a smaller (perhaps negative) proportion of their 
lower future incomes. For both, a mandatory pension plan is particu- 
larly likely to increase total saving. A more sophisticated analysis of 
the effect of pensions on saving might detect differences of this sort. 

The case of athletes points up the extreme differences in behavior 
that self-control can produce. Their declining income stream creates a 
difficult self-control problem in the high-income years. Some athletes 
hire agents to invest their incomes and limit their current spending, 
and many of them become rich. Others rely on discretionary strate- 
gies and end up bankrupt. Both types of behavior are possible in our 
model, depending on the degree of planner control and the types 
of precommitment strategies available. The best predictors of which 
individuals will fall into which groups are probably related to family 
background, since the family is the most likely place for the individual 
to learn (or not learn) the rules and norms necessary to overcome the 
self-control problems.” 3 


1 ¥.g.,1n 1969, 55 million taxpayers received refunds while 18 million owed taxes. In 
dollars, overpayments exceeded underpayments by 39 percent. 

2 The last two paragraphs were prompted by a suggestion from Sam Peltzman. 
Irving Fisher also discussed some of these issues. He notes that some individuals spend 
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C. Individual Marginal Rates of Time Preference 


The orthodox theory of intertemporal choice as formulated by Irving 
Fisher produces a very strong result. Each person should equate his 
marginal rate of time preference (MRTP) with the relevant after-tax 
interest rate.!? Thus the theory predicts that all individuals who face 
the same after-tax interest rate will make the same marginal inter- 
temporal choices: Specifically, they will act as if they used the after-tax 
interest rate as their discount rate. This follows because individuals 
are assumed to use capital markets to arbitrage away any difference 
between what Fisher called their “rate of impatience” and the interest 
rate. It should be noted, however, that this result can fail to hold if 
capital markets impose quantity constraints on borrowers (capital 
rationing). In this case borrowers may be forced to stop borrowing 
even though their rate of impatience exceeds the interest rate. 

In exactly the same fashion, self-imposed borrowing constraints 
such as those discussed in Section IIB prevent the complete internal 
arbitrage from taking place. Thus in our model, in which such con- 
straints play an important role, the presumption that individual 
MRTPs will equal the interest rate no longer holds. Indeed, we expect 
to observe behavior that implies an MRTP greater than the interest 
rate and at the same time an unwillingness to engage in additional 
borrowing. Two points need to be raised about this implication. First, 
once rules are incorporated into the analysis, the failure to equate the 
MRTP to the interest rate may not violate an optimality condition. If 
rules are used it is because they lead to higher levels of utility than 
pure discretion would. The inequality created is costly, but the cost 
arises from the necessity of using a second-best technology. Because 
rules by nature must be simple, rules that select the precisely correct 
consumption bundle in every situation are infeasible. Second, notice 
that if a quantity constraint is binding, whether internally or exter- 
nally imposed, observed MRTPs will be equal to or greater than the 
interest rate. 

Attempts to measure individual MRTPs appear in studies by Kurz, 
Spiegelman, and West (1973) and Hausman (1979). Hausman studied 
families’ purchases of room air conditioners. The trade-off between 
initial outlay and operating costs permitted him to estimate implicit 
MRTPs. The mean MRTP in his sample was about 25 percent, clearly 
above any relevant interest rate. Kurz et al. obtained similar results 


their weekly paycheck at the “grog house.” (Others, we believe, avoid the grog house 
precisely on those days.) He attributes much of the observed differences in behavior to 
social class. Our differences with Fisher are discussed in the next section. 

'8 We define the MRTP to be the marginal rate of substitution between tomorrow’s 
consumption and today’s consumption minus one. 
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by asking hypothetical questions of participants in the Seattle and 
Denver Income Maintenance Experiments. They asked a sample of 
participants a series of questions of the following sort: What size 
bonus would you demand today rather than collect a bonus of $100 in 
I year? Several different forms of this type of question were asked, 
and the results were striking. For whites the mean rate of time 
preference implied by their answers varied between 36 and 76 per- 
cent. For blacks the rates varied between 40 and 122 percent. Of 
particular interest for present purposes is the fact that this sample 
included only those respondents who said they could borrow either 
$500 to make an installment purchase or $1,000 in cash. Further- 
more, 81.3 percent of this subsample reported that they would not 
borrow $1,000 at current interest rates. (The mean perceived rate was 
generally less than 20 percent.) This strongly suggests the use of 
self-imposed borrowing constraints. 

Once individual differences in MRTPs are anticipated, it becomes 
interesting to ask what factors determine those differences. A detailed 
examination is beyond the scope of this paper, but the model predicts 
that those factors determining individual rates of impatience will (for 
the reasons stated above) also affect observed MRTPs. These factors 
are discussed at length by Fisher (who draws on the writings of John 
Rae and Eugen Bohm-Bawerk). Fisher believed that age, income, and 
marital status affect the rate of impatience (in obvious directions). He 
felt that the shape of the income stream as well as the level was 
important; if income were expected to rise, the rate of impatience 
would be higher. Six “personal characteristics” were also deemed to 
be important: foresight, self-control, habit, expectation of life, con- 
cern for the lives of other persons, and fashion." 

In principle, the planner-doer model can be tested against the 
standard model quite simply. Observe some intertemporal choice that 
implies a discount rate (such as Hausman’s study of air conditioners). 
Then regress the implied discount rate on the factors above and the 
individual’s borrowing rate. The standard framework implies that 
only the borrowing rate will be a significant predictor. Our model 
implies that the other factors will also be important. Hausman did test 
one such variable. The implied rate of discount was computed for six 
income classes. His results appear in table 1. Clearly the variation in 
discount rates cannot be attributed solely to variations in borrowing 
rates. 

We would also expect age and social class to be important in pre- 
dicting individual intertemporal choices. The young behave impa- 
tiently in part because they have yet to master the techniques of 


14 Fisher 1930, p. 81. On this topic also see Maital and Maital (1977). 


404 JOURNAL OF POLITICAL ECONOMY 


TABLE 1 


ESTIMATED DiscouNT RATES USING MEAN POPULATION ESTIMATES 











Income Class Implied Discount 
($/yr) Observations (N) Rate (%) 
600 6 89 

10,000 15 39 

15,000 16 i 

25,000 17 v7, 

35,000 8 8.9 

50,000 3 5.1 





Source.—Hausman 1979. 


self-control. To the extent that these techniques are learned from 
parents, class differences will be observed. Most important, our model 
stresses the theoretical admissability of these variables. Only further 
empirical work can establish their relative explanatory power. 


IV. Summary and Conclusion 


We now briefly recapitulate our argument. We have investigated 
intertemporal choice as a problem in the economic theory of self- 
control. As the quotation at the beginning of Section I states, the 
concept of self-control is paradoxical unless some kind of multiself 
model of man is adopted. We have introduced self-control into a 
formal model of intertemporal choice by modeling man as an organi- 
zation with a planner and many doers. Conflict occurs because the 
doers are myopic (i.e., selfish). This conflict is fundamentally similar 
to the agency relationship between the employer and the employee, 
and individuals use many of the same strategies that organizations 
adopt to deal with their “conflicts of interest.” These strategies can 
involve doer/employee discretion while their incentives have some- 
how been altered, or they may entail the implementation of precom- 
mitment (a rule) to avoid the doer/employee decision process al- 
together. 

The close correspondence between the solutions to control prob- 
lems adopted by organizations and individuals provides strong sup- 
port for our model. Although our model is nontraditional, our tools 
are strictly traditional. Formally, our model closely resembles that used 
by Ross (1973) in his study of the theory of agency. Finally, we note 
that ours is a theory of rational behavior, just as Ross’s theory is of 
profit-maximizing behavior. 

Many applications of the model are possible, and we have discussed 
a few briefly. The most important applications are in the study of 
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individual saving behavior. Our hypotheses are quite different in 
spirit from the permanent income and life-cycle hypotheses that cur- 
rently dominate the literature. On the basis of the evidence presented 
here, we feel these theories of saving should be reevaluated. 
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A Note on Loss of Control and the Optimum 
Size of the Firm 


Antonio Camacho and William D. White 
Unwersity of Illinois at Chicago Circle 


This note has been inspired by a recent paper in this journal by Calvo 
and Wellisz (1978). Its main purpose is to suggest that, in constructing 
models to study the size of firms, more attention needs to be given to 
the conditions which bring about their existence. 

In Section II of their article, Calvo and Wellisz prove a proposition 
that given certain assumptions about supervision, if a firm of any size 
exists, then loss of control will not impose finite limits on its size. 
Although their result is correct, they, as well as some other research- 
ers working in this field, fail to give explicit reasons for firms to 
exist. However, as we will argue below, there are not convincing 
reasons, under their assumptions, for firms of any size to exist, thus 
making their case for the appearance of unbounded firms rather 
weak. 

This result prompts us to suggest, as stated above, that more atten- 
tion must be given in this area of research to the very conditions which 
bring about the existence of firms. For instance, models which, in the 
spirit of Williamson’s (1967) paper, explicitly consider problems with 
information and communication, or models where it is shown that the 
activities of workers or agents may be coordinated more efficiently 
through extra market arrangements, may both produce realistic and 
interesting results regarding the size of firms. 

We proceed as follows. First, we prove that, under Calvo and 
Wellisz’s assumptions, firms will never exist if workers can be self- 
employed. Second, we show by means of a “setup” cost example that, 


We are grateful to Joseph Persky for his comments on an earlier version of this note. 
Any errors which remain are, of course, our own. 
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even if conditions which limit the opportunities of workers for self- 
employment exist, Calvo and Wellisz’s assumptions regarding skills, 
utility functions, and productivity of workers seem to imply that there 
are more efficient arrangements to carry out production activities 
than the formation of firms. The note concludes with an intriguing 
conjecture and a suggestion. 


I. No Limitations on Opportunities for Self-Employment 


We will proceed now to prove that, if there are no limitations on 
workers’ opportunities for self-employment, then under Calvo and 
Wellisz’s assumptions firms will never exist. We will obtain this result 
by showing that, under their assumptions, a worker can always 
achieve at least as high a level of utility by working for himself as he 
can by working as an employee for any firm which makes a positive 
profit out of hiring him. 

In their model, Calvo and Wellisz assume that the (Von 
Neumann-Morgenstern) utility index, U, of every worker is the same 
and depends on consumption, c, and effort, e; more specifically they 
assume: 


U =u(c) -—v(e),c 20,0 se <1, (1) 


where u' = 0,v' 2 0,u"” <0, andv” S 0. (The last two conditions rule 
out risk-loving behavior.) Effort is assumed to be at a maximum when 
e equals one, while the worker is assumed to be completely idle when e 
is equal to zero. 

Workers employed in firms are subject to monitoring where P is the 
probability that a worker will be detected shirking. Workers who are 
not detected shirking are paid a wage of w. Workers who are found 
shirking are paid we. 

For any P and w, a worker will select that level of effort which 
maximizes his expected utility z: 


z = P(u(we) — v(e)] + (1 — P)[u(w) — v(e)). (2) 


Let @ be the value of e which maximizes z. 

Calvo and Wellisz assume also that the productivity of every worker 
is the same and equal to Be (0 < e < 1) when working at the level of 
effort e. Given this assumption, a firm’s expected revenue from em- 
ploying a worker at a wage w and with a level of supervision P will be 
fe. The expected cost to the firm of employing the worker will be the 
expected payments to the worker, P(wé) + (1 — P)w, plus the ex- 
pected cost of providing a level of supervision P. Thus, the total 
expected cost of employing a worker will be greater than or equal to 
P(wé) + (1 — P)w. A necessary condition for the firm to make a 
positive expected profit is then 
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Be > P (we) + (1 — Pyw 


or 
Be 
PSPS 5 7) Sate o 


Let E(U) be the maximum expected utility which a worker can 
obtain by working for a firm at a wage rate w and under a level of 
supervision P: 


E(U) = P(u(we) — v@)] + (1 — P)[u(w) — v@)). (4) 


The expected utility of a worker when self-employed and applying 
himself to his work at the same level of effort 2 as in (4) is 


E(U) = u(Be) — v€). (5) 


Taking into consideration condition (1) and inequality (3), we can 
easily obtain 


E(U) <u(Bé) — v@), 


which in view of (5) proves our assertion. 


II. Limitations on Opportunities for Self-Employment 


We will now discuss conditions that may limit workers’ opportunities 
to be self-employed. To facilitate this discussion we will first enumer- 
ate the main assumptions of Calvo and Wellisz’s model: Everyone has 
the same skills; everyone has the same utility function and, therefore, 
the same attitude toward risk; there is no specialization of labor, no 
need for communication of instructions or for coordination of ac- 
tivities; there are constant returns to scale when everyone applies the 
same level of effort to production activities. 

Given these assumptions and the additional assumption that there 
are no institutional restrictions on self-employment, it appears that 
the only condition which might limit opportunities for self-em- 
ployment for part of the population is the existence of some factor 
of production, say land, whose ownership is unevenly distributed. 
This situation might allow owners to pay workers (those who lack any 
endowment of this factor) a wage below their marginal product. But 
let us analyze this situation in more detail. Consider two cases: (i) only 
one individual owns all the land; (11) more than one individual owns 
land. 

In case i, an individual who is a monopolist may be able to start a 
firm and make total profits greater than B by hiring workers at a wage 
below their marginal product. Then, as Calvo and Wellisz rightly 
prove, the owner can increase his profits by adding layers to the 
hierarchical structure of his firm until he employs everybody. 
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But this is a rather cumbersome arrangement. Suppose that the 
total amount of land is L, the total number of workers is k, and the 
maximum profit the monopolist can extract by employing k workers is 
a(k). By renting each worker a plot of land L/k at a price of w(k)/k, the 
monopolist can obtain the same amount of money as he would by 
establishing a firm, while workers will achieve a level of utility greater 
than or at least equal to the level they would obtain by working for the 
monopolist. Under these circumstances, firms are unlikely to be es- 
tablished. 

Similarly, firms are not likely to be established in case ii either. Any 
owner who might start a firm always has the simpler option available, 
like our monopolist in case i, of renting his land to the workers of his 
hypothetical firm and charging them a rent equal to the maximum 
possible profit he could obtain from establishing a firm divided by the 
number of his employees. Under this arrangement, the owner will 
receive the same amount of money as he would receive if he set up a 
firm, while workers will obtain a level of utility which is greater than 
or at least equal to the level of utility they would obtain by working for 
the owner as employees. 

The results obtained by Calvo and Wellisz in section II of their 1978 
paper and our own discussion above lead us to make the following 
conjecture, which we present as a still unproved theorem: that for any 
set of conditions under which there is no limit to the size of firms, 
there is no reason for firms to exist. And this conjecture, together with 
the fact that firms do exist and are of limited size, in turn suggests to 
us that the optimal size of firms may be determined by those very 
conditions that bring about their existence. 
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Prior Information and the Observational 
Equivalence Problem 


Warren E. Weber 


Virginia Polytechnic Institute and State University 


The observational equivalence problem pointed out by Sargent (1976) 
consists of two parts. The first is whether regression analysis of so- 
called classical and Keynesian reduced-form equations can discrimi- 
nate between the competing hypotheses, using data from a single 
policy regime in the absence of any prior information concerning the 
policy rule being followed. The answer to this question is no and can 
be shown as follows. 

Consider the reduced-form equation for a “goal” variable (e.g., real 
GNP). The classical version of this reduced-form equation would be 


a(L)y- + B(L)(m, — Ex=1mz) = Be (1) 


where m, is a policy variable (e.g., the nominal money supply), and 
fy, is a white-noise error term. This equation is classical, since only 
innovations in the policy will affect the goal variable. As a conse- 
quence, the time path of the goal variable will be independent of the 
policy rule chosen, so long as the rule implies m, = E,_,m; + &, where 
€, is a white-noise error distributed independently of x. 

The Keynesian version of this reduced-form equation would be 


a(L)y, + b(L)m, = %. (2) 


Assuming that (2) is invariant with respect to interventions that 
change the m, process, it is broadly “Keynesian,” since it would be 
possible to solve (2) for a policy feedback rule which would minimize 
the deviations of the goal variable around some desired level. 


Professor of economics, Virginia Polytechnic Institute and State University. I am 
indebted to Anatoli Kuprianov, Bennett McCallum, and an anonymous referee for 
useful comments on this paper. 
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To show that estimation of (1) and (2) will not distinguish between 
the Keynesian and classical positions, suppose that the policy variable 
follows the rule 


y(L)m, = & (3) 


over the entire observation period. Then substituting (3) into (1), we 
obtain 


a(L)y, + B(L)y(L)m; = be (4) 


In the absence of any prior information on the order of the 
polynomials y(L) or B(L), it would be impossible to distinguish em- 
pirically between (4) and (2). That is, if both equations included the 
same number of lagged terms, they would have the same sum of 
squared residuals, since B(L)y(L) would be an arbitrary polynomial in 
L. Further, since (4) has an identical error to (1), the sum of squared 
residuals obtained from estimating (1) would be the same as that from 
estimating (2), regardless of whether the Keynesian or classical hy- 
pothesis were true. Thus, in the absence of any prior information 
on the lengths of the distributed lags in the model, it is impossible to 
empirically distinguish between the Keynesian and classical positions 
using data from a single policy regime. 

The second question is whether such a test can be performed when 
prior information on the order of y(L) or B(L) is available. McCallum 
(1979) has pointed out one such case. Our purpose is to generalize 
McCallum’s result and to relate it to Sargent’s original paper and the 
empirical tests of the classical and Keynesian hypotheses performed 
by Barro (1977, 1978). 

Let c(L) be the polynomial in the lag operator L of the regression 
coefficients on the m, terms when equation (4) is estimated; that is, c (L) 
= B(L)y(L). Thus, (4) can be empirically distinguished from (2), if 
and only if there exist testable restrictions on the relationship between 
y(L) and c(L). To examine the nature of these restrictions, first note 
that a typical element of the convolution c(L) = B(L)y(L) is 


Cj = » BrYi-k- 


k=—0 


Thus, c(L) = B(L)y(L) can be written in matrix form as 


C =H (y)B, (5) 
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where H(y) is the (n + m + 1) x (n + 1) matrix 


0 0 O 
0 Oms.0 
Yo ie) 
Aly) = : (6) 
Va TAD 
mee 
0 eo. OF Ry, 





B = (Bo, Bi, ---, Bn)’ is the vector of coefficients of B(L), and C = (, ¢,, 

- > nim)’ 18 the vector of coefficients of c(L). Then by the result of 
Rothenberg (1973, p. 37), knowledge of m or n will yield restrictions 
relating the coefficients of y(L) and c(L) if rank [1 (y)] < (n + m + 1). 

With this result, we can now interpret Sargent’s original argument 
and McCallum’s result. Sargent considers the case in which n = m = ©, 
In this case, the restriction that the rank of H (y) be less thann +m + 1 
is not satisfied, and no restrictions between y(L) and c(L) are ob- 
tained. McCallum considers the case in which n = 0, m = ©. In this 
case rank [H (y)] = 1 < ~, and the prior information given by knowl- 
edge of y(L) implies restrictions on c(L). 

Rothenberg’s result, however, allows a more general statement of 
when knowledge of n or m implies that restrictions between y(L) and 
c(L) exist. This general result is that knowledge of n or m will imply 
that testable restrictions between y(L) and c(L) exist, unlessn = © orn 
is finite and m = 0. This generalization follows directly from the rank 
condition stated above. It implies that unless y, depends on all past 
innovations in policy or policy is white noise, the classical reduced- 
form equation (4) can be tested against the Keynesian reduced-form 
equation (2). The test is whether imposing the restrictions of equation 
(5) leads to a statistically significant decrease in the explanatory power 
of the regression equation. 

The test of the classical model by Barro used the policy equation 


y(L)m,y + AL ze = & (3’) 


in place of (3), where z, is an exogenous variable (such as the min- 
imum wage or level of unemployment compensation) and )(L) is 
a polynomial of order r in L with dy) = 0. Substituting (3’) into (1) 
would yield the new classical reduced form 
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a(L )y, + B(L)y(L)m, + BIL)A(L ze = Me (4') 
to be compared with the appropriately modified Keynesian version 
a(L)y, + b(L)m, + d(L)z_ = )- (22) 


Now let f(L) be the polynomial in L of the regression coefficients on 
the z, term when (4’) is estimated, so that f(L) = B(L)A(L). Then using 
the same arguments as above, (4’) can be empirically distinguished 
from (2') if there exist testable restrictions between y(L) and c(L) or 
between A(L) and f(L). The case of c(L) was considered before. 
Following the reasoning used for that case, we find that knowledge of 
r will yield testable restrictions between A(L) and f(L) if and only if 
rank [27 (\)| <et. er, + 1). Since 7 (X) ts* Gigs x Geet 1): 
knowledge of 7 will not imply that testable restrictions between )\(L) 
and f(L) exist when n = ~. Thus, Barro’s method of including exoge- 
nous variables in the policy rule which do not appear in (1) will not 
yield testable restrictions to distinguish classical and Keynesian re- 
duced forms when y, depends on all past innovations in the policy. 
However, when n is finite and m = QO, so that no testable restrictions 
between y(L) and c(L) exist, the knowledge that r 2 1 implies that 
testable restrictions between \(L) and f(L) exist. Thus, in this case, 
Barro’s method will permit the classical and Keynesian reduced-form 
equations to be distinguished when they could not be if z, were 
omitted from the policy rule. 

The practical usefulness of these results, of course, depends on the 
existence of reliable theoretical restrictions on the order of the lag 
distributions of B(L), y(L), or A(L). The order of B(L) depends on 
the extent to which past errors in the expectation of policy affect the 
current level of the goal variable. One source of restrictions on the 
value of n can be found in the models of aggregate supply developed 
by Lucas (1972, 1973) and Leiderman (1979) in which producers in 
informationally separated markets face the problem of distinguishing 
relative price changes from absolute price changes, given current 
information only on the prices they receive. In these models, output is 
affected only by the current error in the expectations of the price level 
implying n = 0. The same restriction on the value of n arises from 
Sargent’s (1978; 1979, chap. 16) models of employment. McCallum 
also argues in favor of such a restriction on the grounds that “it is 
hard to imagine ways in which past expectational errors could have 
direct effects on current behavior—bygones are, after all, bygones” 
(1979, p. 398). Thus, there appears to be a solid theoretical basis for 
imposing the restriction that n = 0 on B(L).1 


* This conclusion is not altered by the consideration of adjustment or search costs. 
Such costs may lead to past expectational errors having indirect effects on the current 
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Given n = 0, the rank conditions above indicate that testable re- 
strictions between c(L) and y(L) or between f(L) and A(L) will exist 
unless policy is white noise (i.e., unless m = r = 0). As yet, there is no 
satisfactory theory of how policy functions are determined, which 
would allow us to theoretically determine that policy is not white 
noise. However, the evidence from empirical attempts to estimate 
monetary policy functions does support such a conclusion.2 

Thus, we would argue that reliable theoretical and empirical evi- 
dence exists to allow testing of models with classical implications 
against models with Keynesian implications, using data from a single 
policy regime. However, the lack of theoretical restrictions on m or r 
does caution that such tests should check for their robustness to 
changes in the length of the lags assumed in the policy functions 
underlying the test. 
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value of the goal variable, because they would affect its value in previous periods. 
However, such effects would be captured in a(L) (see McCallum 1979, p. 398). 

* See Froyen (1974), Neftci and Sargent (1978), and the references contained therein 
for estimates of monetary policy functions which support this conclusion. 
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The Enclosure of Ocean Resources: Economics and the Law of the Sea. By Ross D. 


ECKERT. 
Stanford, Calif.: Hoover Institution Press, 1979. Pp. 390. $16.95. 


This is a clearly written book about the Law of the Sea, applying Coase/ 
Demsetz economic ideas to regulation of, and property in, six or more uses of 
ocean resources. The central theme is that national “enclosure” by the exten- 
sion of base lines, and widening of territorial seas and various types of 
exclusive economic zones, are analogous to the historic “enclosure” of com- 
mon lands and the appropriation of frequencies and so on. The book closes 
with chapters on UNCLOS (the United Nations Conference on the Law of the 
Sea) as a venue in which enclosure is debated and arranged. 

Written for a more general audience than academic economists, the book is 
sometimes less than rigorous in its analysis of rents, profits, and transactions 
costs and may also be given to sliding over some difficult details of law, 
geography, engineering, and the other fields in which expertise is needed to 
write such a book satisfactorily to all. But the documented attempt to put the 
claims of an economics/law property rights approach is worth pondering. 

The author fully realizes that “watertight” exclusivity arrangements for 
some ocean uses would be astronomically costly or even technically impossi- 
ble. Consider fisheries, to which he devotes a long chapter. Enclosure of 
coastal fisheries leads to internalized control over local stocks but is obviously 
less reliable in controlling the harvesting and growth of migratory species. For 
the latter, the author faintly recommends (as he does for some types of 
oil-spill problems) a mixed international-coastal regime. But he warns that, 
for such arrangements, the transactions costs between nations are high; 
he conjectures that costs for cooperation, enforcement, and decision making 
increase seriously with the number of countries participating; and he comes 
out in favor of single-coastal-nation fisheries’ regimes. 

In this recommendation he is in distributive deep water, as he recognizes. 
The alternative to single-nation control is treaty-organized “communal” con- 
trol, giving some power to all nations. Eckert argues that this system is inferior 
to that by management by individual states. The evidence of inferiority is only 
one of cost and effectiveness: that transactions costs do seem high, and success 
does seem incomplete, in such bodies. On such cost evidence, the author 
seems inclined simply to argue that single-nation regimes should prevail. But 
this alternative is not one that is available. Not only are many ocean resource 
uses unenclosable physically, but many claims to conflicting fishing or other 
rights are of ancient vintage and cannot be cancelled merely by an economist 
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ee that fishing success is higher and costs lower when one country takes 
all. 

There zs an answer, involving multinational equity ownership of the rights 
to use certain ocean resources, actual management being entrusted by all to 
one nation or to a private firm. The Pacific fur seal treaty provides an 
example. Some UNCLOS proposals having to do with minerals and other 
resources could be interpreted or remolded in this form, but Eckert does not 
do so. His adamant opposition to large numbers is therefore unconvincing. (A 
side benefit of this discussion is his analysis of the process of national adjust- 
ment to the likely future outcome of UNCLOS bargaining. A country about 
to make a bargain that depends upon its initial position tries to magnify its 
initial position. Eckert notes this several times.) 

A second answer to the many-nation difficulty is to offer compensation to 
those who are excluded. If it is too costly to let all nations fish or mine, then 
Eckert should provide for some way to buy them out. Although the process of 
doing this would be costly, it might work, and the UNCLOS pursuit of the 
dogma of the oceans being “the common heritage of mankind” could be 
utilized here. But Eckert seems to me to give little if any weight to the view 
that UNCLOS is a constitution-making body from which all must depart 
better off than they arrived. 

In other words, his quickness and clarity in making points against UNCLOS 
procedures do, it seems to me, blind him to the difficulty that distributional 
imperatives do call for multination ownership or benefit. Otherwise the injus- 
tice of exclusion would match that of the original enclosure movements, 
which Eckert seems to ignore. 

In economists’ jargon, Eckert seems strongest on efficiency and weakest in 
the realms where distribution and public choice meet. This is a pity, for his 
book ends in frustrated anger at the “hypocrisy” of bargaining at UNCLOS, 
when his earlier treatment, clean and informative, somewhat extended be- 
yond the domain of operating transactions costs, would suggest the limits of 
what is possible given the costs of constitution making and the conflict in every 
nation’s breast of distributional and efficiency objectives. 


ANTHONY SCOTT 
Unwwersity of British Columbia 


Issues in Financial Regulation. Edited by FRANKLIN R. Epwarps. Regulation of 


American Business and Industry Series. 
New York: McGraw-Hill Book Co., 1979. Pp. 526. $42.50. 


This book can best be thought of as an intellectual sampling of approaches to 
regulatory problems by a diverse group of economists and lawyers. For those 
who want such a sampling, reading it would be useful. All economists do not 
think one way about regulation, and all lawyers another. There is a diversity 
of opinion in both camps. As an economist, I will venture the opinion that the 
economics profession can offer two criteria that should be applied to analyses 
of regulatory problems: (1) Are sound theoretical notions applied to the 
problems? and (2) Is good empirical methodology used? The first criterion 
involves not merely an understanding of theory but an understanding of the 
problem, how it arises, and the market forces that surround it. The second 
criterion involves not merely the gathering and analyzing of information but 
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asking the right questions about the type, amount, and quality of information 
needed to analyze the problem. It is useful to look at the papers 1n the book in 
this manner. Lawyers will undoubtedly have other criteria, but in my opinion 
a great number of our regulatory problems have arisen because one of these 
two criteria has been violated. 

The papers in this book indicate that some thinkers pay attention to these 
criteria, others do not, and the quality of the papers is influenced accordingly. 
Kenneth Scott analyzes the dual banking system as a model of competition 
among regulatory agencies. By dual banking system, he means the system of 
alternative chartering at the state and national level and the system of divided 
regulation at the federal level that goes along with it. However, he only 
analyzes one-half of the problem. Competition among regulators lets banks 
prevent the imposition of unduly burdensome regulation by using the option 
of a change in chartering status to play one regulator off against another. 
Scott says this is good. But unless there are other checks on the regulators, the 
logical end of such a process is no regulation at all—that is, competition in 
laxity. But other checks do occur. Bank regulation and supervision is a 
technical business in which the best systems and procedures are far from 
obvious even to a trained professional. Regulators, like the rest of us, do not 
like failures, and they learn from each other’s mistakes. As in the mar- 
ketplace, alternatives in bureaucratic regulation promote innovation and effi- 
ciency, a form of “competition in excellence.” In their periodic oversight 
hearings on the condition of the banking system, the Congressional banking 
committees are not just assessing the credibility of the reports of one 
monolithic regulatory agency. They are comparing the performance of sev- 
eral agencies. It is not obvious that the system produces overregulation or 
underregulation. But there are inherent checks and balances against either 
tendency. 

Bank regulators have the dual responsibility of preventing extensive bank 
failures which would bring about unwanted contractions in the money supply 
and of insuring low-cost delivery of banking services to the public. This dual 
responsibility should be kept in mind when analyzing proposals for change. 
Scott presents an extremely complicated organizational chart to depict the 
structure of the system that regulates commercial banks. The chart displays 
the interrelationships between the Comptroller of the Currency, the Federal 
Reserve, the Federal Deposit Insurance Corporation, and the 50 state regu- 
latory agencies. The chart is incomplete. It does not include the Securities and 
Exchange Commission, the Federal Trade Commission, the Treasury De- 
partment, the Department of Housing and Urban Development, the Com- 
merce Department, the Labor Department, and the Energy Department— 
some of the other agencies that have gotten into the business of regulating 
banks. In addition, the Federal Home Loan Bank Board, the National Credit 
Union Administration, and the Farm Credit Administration—all omitted 
from the chart—are also regulators of deliverers of banking services and 
interact with the other bank regulators. The chart was constructed by Senator 
Proxmire’s staff for the purpose of justifying consolidation and simplification 
of federal bank regulation. If we wish to design bank regulatory systems for 
the administrative convenience of the regulators, the senator’s proposed 
reforms merit consideration. But if the purpose of the regulatory system is to 
insure continuous and efficient delivery of banking services in the mar- 
ketplace, the failures of the current system are not obvious. 


Failure to apply both criteria is illustrated in Roy Schotland’s paper on 
conflicts of interest. He relies extensively on anecdotal evidence and con- 
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structed situations with very little analysis of market structure or economic 
incentives. On page 125 he says that most conflicts occur because competition 
is not effective, an assertion that is not Justified with any evidence. On page 
128 he hastens to add that most conflict situations are not exploited. Could it 
be that competition is effective? He says that confidentiality requirements 
make “marketplace type shopping” unfeasible for doctors, lawyers, and in- 
vestment managers. No evidence is offered on the extent to which consumers 
actually do shop around for the services of such individuals. I make an 
attempt at it, and I do not think I am that unusual. Schotland makes an issue 
out of the conflict an investment banker incurs because of his obligation to 
“the buying public, to corporate issuers, and self-interest.” He does not say 
why this is any different from the problem faced by the president of a 
manufacturing firm who has to mediate the competing claims of customers, 
suppliers, and stockholders, or why it should require any more regulation. 

The paper by Robert Shay, “Consumer Protection in Credit Markets: An 
Analysis of Regulatory Reform,” is an excellent example of judicious applica- 
tion of both criteria. Shay uses the economic analysis of property rights to 
assess the regulatory reform movement affecting consumer credit markets. 
He develops an analytical framework and applies it to one limitation of a 
creditor's remedy to recover debt: the prohibition of wage assignments. He 
then compares his analysis of the problem to that used by the Federal Trade 
Commission (FTC). The comparison is fascinating and revealing. Although 
he does not do any empirical testing, he develops a precise way of getting 
answers to the questions needed to implement his theory. 

Shay believes a problem in consumer credit markets today is that the focus 
of bargaining is on rates and terms and not on the remedies and rights of 
creditors and borrowers. Rather than prohibit wage assignments, parties 
should be permitted to bargain over whether or not they are to be allowed in 
the contract. The author notes that when wage assignment provisions are 
allowed they are not uniformly found in all credit contracts. This suggests 
that when they are included, the reason is to lower the cost of credit to 
high-risk applicants. 

To assess the need for government interference Shay suggests using Dem- 
setz’s principle of compensation. The government should try to buy permis- 
sion to allow the activity from those who feel they would be harmed by 
allowing it, and it should try to buy permission to restrict the activity from 
those who feel they would benefit from allowing it. The lower-cost option 
should be purchased. 

Shay’s analysis indicates that the adversary relationship is not between 
debtor and creditor but between those who will be harmed if there is freedom 
to contract for remedies and those who will be harmed if there is not. The 
creditor is best viewed as an intermediary. Only a modest portion of the funds 
being lent are his own funds. Nevertheless, the creditors’ adjustments to 
governmental intervention determine who gets helped and who gets hurt. 
The need is for marginal benefit/cost comparison between two groups of 
debtors, not between the debtors vis-a-vis the creditors. The rather crude 
balancing process used by the FTC is between the creditors and the debtors. 

Two of the papers dealing with electronic funds transfer (EFT) propose 
radical policy changes without using either of the criteria mentioned above. 
Almarin Phillips claims that regulation and institutional arrangements— 
interest rate ceilings, deposit insurance, etc.—cause difficulties in the admin- 
istration of monetary policy. Undoubtedly this is true. But he then goes on to 
assert, in language phrased to make the dissenter feel antediluvian, that EFT 
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makes all of this substantially worse and will bring a crisis. He offers no 
evidence or analysis to support his conclusion, and I, for one, dissent from it. 
To be sure, many regulations cause difficulties for private bankers, central 
bankers, and the public. But it is the regulations that cause the problem, not 
the use of electronic forms of communication. 

Similarly, Alan Westin says that EFT creates the need for new statutes to 
spell out individuals’ rights. He says that when information is merged from a 
variety of sources and not used for one-time purposes a new trustee relation- 
ship is created. Of course, such information gathering is as old as banking 
itself. But when it is done electronically, Westin believes new laws are called 
for. He is not exactly sure about what the new laws and regulations should be. 
He proposes a decade or two of controlled experimental regulation. He gives 
no discussion of current abuses to support this recommendation. 

Arnold Heggestead does a survey of studies of bank competition and 
performance. Most of the articles surveyed, and the survey itself, suffer from 
a lack of application of the criteria mentioned here. This is best illustrated by 
Heggestead’s ambiguous and contradictory conclusion. He starts out by stat- 
ing that local banking markets are “more monopolistic than competitive.” He 
ends by stating five major areas where further empirical research is needed to 
clear up anomalies. 

If he had started out by analyzing the way markets operate he could have 
explained some of the anomalies himself. Interinstitutional competition is a 
fundamental fact of today’s marketplace. If one examines all of the various 
asset powers, liability powers, and fee income powers of commercial banks, 
there is only one privilege that commercial banks have that is not granted to at 
least one other financial institution. This is the power to offer demand 
deposits, a privilege whose value is rapidly being eroded by market innova- 
tions and changing product mixes. In 1947, commercial banks held 56.3 
percent of the assets of all financial institutions. In addition to thrift institu- 
tions, this latter category includes life insurance companies, private pension 
funds, state and local employee retirement funds, money market funds, and 
finance companies. In 1977 the corresponding figure was 39.6 percent. In 
1947, they held 81.7 percent of the total deposits of all depository institutions. 
In 1977, it was 59.3 percent. In 1947 commercial bank deposits were 56.2 
percent of the total liabilities of all financial institutions. In 1977 the corre- 
sponding figure was 32.4 percent. Demand deposits, the product for which 
commercial banks are sometimes said to have a “monopoly,” were 40.3 per- 
cent of total financial institution liabilities in 1947. In 1977, the corresponding 
figure was 10.9 percent. This is not the picture of a monopolistic industry. 
When they had reason to do so, bank customers were highly sophisticated at 
finding alternative ways of meeting their financial needs. 

Some of the other papers are good examples of applications of the criteria 
set forth here. Others are not. But enough has been said to illustrate the 
problem. 

The book suffers from a lack of discussion of the political aspects of 
regulation. I refer to the process whereby information is gathered through 
hearings and testimony, and laws are passed; and more information is 
gathered through hearings and testimony, and regulations are written. With 
increasing frequency, outsiders are having a hard time matching the laws with 
the regulations, and Congress and the regulators are pointing their fingers at 
each other. I have no final answer to this problem, but I will suggest two 
partial causes. First, the process inherently favors anecdotal evidence over 
econometric work and surveys which, because they are subject to 
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methodological criticisms, inevitably present a more balanced view of the 
world. Second, when the banking system works well, the bankers get the 
credit, not the regulators. When it goes awry, the regulators share a good 
portion of the blame. This inevitably promotes regulations designed to pro- 
tect inefficient banks, to the detriment of the customers of well-managed 
banks. 

P. MicHaEL Laus 
American Bankers Association 


New Challenges to the Role of Profit. Edited by BenjaMIN M. FRIEDMAN. 
Lexington, Mass.: D. C. Heath & Co., Lexington Books, 1978. Pp. vit 127. 
$14.95. 


In 1976, Professors Samuelson, Arrow, and Lundberg presented lectures in 
the third series of the John Diebold Lectures. Along with an introductory 
chapter by Professor Friedman and a concluding round-table discussion, 
these lectures comprise New Challenges to the Role of Profit. Samuelson gives a 
broad-brush treatment of the trend of profits, and intellectual opinion about 
profits, in the modern mixed economy. He also includes a brief history of 
profit theory. Arrow links criticism of profits to general attacks on private 
property, but he emphasizes the conceptual differences between profit, 
property income, and private property. Lundberg reports on the Swedish 
“middle way”—now dubbed “fund socialism”—and attempts to draw conclu- 
sions for Western countries generally. 

All things considered, the book is a disappointment from almost any per- 
spective. Even for a work of this genre, New Challenges lacks focus. Though 
edited by a perspicacious scholar who contributes an introductory essay, the 
book wanders. The subject remains unclear. The principal lectures were 
delivered by academics, with comments by a mixture of academics and 
nonacademics. At a minimum, one would have hoped for careful delineation 
of a profit concept, distinguishing profit from equilibrium returns to owners 
of capital, land, and other scarce resources (1.e., interest and economic rent). 
Though any number of allusions and some specific references are made to 
the distinction between interest (or ordinary profit), rent, and pure profit, 
these various incomes are constantly conflated. The analysis is further con- 
fused by unwarranted shifting in the lectures between static and dynamic 
analysis. For instance, in any essay containing a number of fruitful insights, 
Friedman observes that “real economies are dynamic, not stationary” (p. 4); 
he then notes the importance of this realization for our understanding of 
entrepreneurship, innovation, development, and industrial structure. This 
discussion is immediately followed, however, by a reference to the intertem- 
poral allocation role of profits (more properly an interest problem). Profits 
are then treated as accruing to owners of capital, thus entangling pure profits, 
rents, and interest. Shortly thereafter, Friedman considers the trade-off be- 
tween equity and efficiency—or distribution and allocation—arising in a 
profit system. Almost every contributor eventually refers to this trade-off. 
The discussions of the “trade-off” remind one of those concerning the “cruel 
choice” between inflation and unemployment. There may indeed be different 
combinations of income distributions and allocation efficiency, depending on 
the taxes and other constraints placed on profits, but it is by no means clear 
that any stable social choice set exists from which one could pick and choose 
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combinations. Further, if one considers the behavior of economies over time, 
then the question is not whether profits (however defined) contribute to 
income inequality, but whether the profit system produces more or less income 
equality compared to other, feasible economic systems. The latter question 
is not really addressed in this volume. 

Along with Arrow’s, Friedman’s is one of the better contributions to the 
book. The shifting from dynamic to static analysis and between profit con- 
cepts, even in the introductory chapter, is indicative of problems in the rest of 
the book. No fruitful analysis of “profit,” or of any other concept, is possible 
without a shared understanding of the nature of the problem being discussed. 
Such a shared understanding is lacking in New Challenges. The contributors 
could surely have adopted a number of profit concepts or definitions. But 
regardless of the concept or definition chosen, reasonable consistency in 
analysis is necessary. In this book there is almost no analysis of profits as the 
essential signaling and motivating force in a market economy. Despite oblig- 
atory references to Schumpeter, no approach treating profit as an income 
emerging from unexpected changes in economic data is seriously considered 
or analyzed. The work of Kirzner, identifying genuine profit as a return to 
superior entrepreneurship—alertness to altered conditions—is totally ig- 
nored (Kirzner 1973, 1979). Yet if the related theories of Kirzner, Mises, 
Schumpeter, and Knight are at all correct, then profit—not some fictitious 
auctioneer—is responsible for any tendency toward plan coordination and 
equilibrium in decentralized economies. Coordination is, of course, always 
incomplete, because change is always occurring. But without appropriable 
profits, the decentralized market system cannot work. 

The Kirznerian profit theory clarifies a number of conceptual problems. 
Profit, being a return to a special labor skill, is no longer conflated with 
interest, a return associated with capital accumulation. Nor is profit a rent or 
quasi rent accruing to a fixed or temporarily fixed factor, as in standard 
textbook analysis. Since profit disappears as quickly as it is publicly identified, 
and since it is not an economic rent, profit is not a taxable surplus. Arrow and 
Lundberg suggest that the ownership of property and the receipt of property 
income can be separated without serious allocational consequences. As Arrow 
put it: “This justification of the charging of price for the use of property is not 
necessarily an argument that the reward should go to the owner rather than 
to someone else” (p. 53). This line of argument is more plausible if profit is 
considered as a residual or rentlike income. In Kirzner’s analysis, however, 
profits are the driving force of economic activity. They are actively and 
consciously strived for. They neither accrue to passive resource owners nor 
are they a residual. Thus, profit income and private property are inextricably 
linked and intertwined. As Marxists have always understood, the fundamen- 
tal issue is the private property system and the very existence of markets. 

The latter point helps indicate the deficiency of the book. Most of the 
challenges to profits discussed are quite old. Yet the contributions do not 
build on the long literature on the subject (Arrow’s essay again is the excep- 
tion). Newer critiques of profit and the market system derive from “highbrow 
opinion” and not from popular concerns (to which the book is ostensibly 
addressed). The ordinary person is not concerned that market economies do 
not attain global maxima and are never in general equilibrium. (Whoever 
suggested they would be?) The ordinary citizen is becoming aware, however, 
of the tendency for governmental policy in a mixed economy to substitute a 
system of monopoly rents and arbitrary incomes for that of earned profits 
(see Hughes 1977). He may lack the theoretical background to explain the 
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process, but the man in the street is surprisingly on the mark in his judg- 
ments. Lundberg tells us that Western economies will, like Sweden’s, become 
more mixed (p. 91). If so, monopolization and socialization of earned income 
will become more widespread. 

In the mixed economy, or fund socialism, the criticism that profits do not 
motivate novation or lead to plan coordination changes from an erroneous 
critique to an essentially correct one. Inter alia, as measured profits are taxed 
at higher rates and supplanted by explicit and implicit subsidies, their moti- 
vational and allocational importance is diminished. Yet this process will re- 
main largely hidden if viewed from a perspective in which profit and eco- 
nomic rent cannot be clearly distinguished in an analytical sense. In the latter 
view, “profit” is a type of surplus that, at least within some range, can be 
altered with little impact on resource allocation. 

If, in fact, this book were to help focus attention on the distinctive operation 
of a mixed, as opposed to a market, economy, then it would have made a 
contribution. Were the book to help spark a public debate on this question, its 
faults could be overlooked. But if such a debate were to occur, it would 
probably be more in spite of than because of such efforts as this book. 


GERALD P. O’DRIsScOLL, JR. 
New York University 
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Economic Equality and Fertility in Developing Countries. By ROBERT REPETTO. 
Baltimore: Johns Hopkins University Press, 1979. Pp. 206. $14.50. 


This work is best viewed as a collection of pieces in which a model of the 
relationship between income and fertility is developed, applied, and in turn 
tested through these applications. The questions raised by the author are 
crucial ones for population policy, his answers are well reasoned, and if there 
is still room for disagreement regarding his policy conclusions it is neverthe- 
less much to his credit that the evidence and the issues are well laid out in the 
book itself. 

The book opens with a statement of its central theme: that fertility is a 
nonlinear negative function of income such that the marginal effects of an 
income change are greater for poor people (although perhaps not for the 
abject poor) than for the rich. It follows from this nonlinearity that a given 
change in income level would have little impact on fertility for families in the 
middle and upper end of the income distribution but might very well have a 
large impact on fertility for poor families. The policy implication is that 
redistribution of income from rich to poor would generate a decline in 
aggregate fertility and hence population growth. 
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Having stated this hypothesis in chapter 1, chapter 2 develops a cross- 
country model for testing its implications. Here it is necessary to assume that 
the basic structural relationship between fertility and income at the family 
level is essentially the same in every country (ceteris paribus). If so, it follows 
quite easily that any redistribution of income from the relatively rich to the 
poor, including transfers from rich countries to poor countries, would reduce 
aggregate fertility. It also follows that the relationship between aggregate 
fertility and income is not fully specified by the level of national income 
without some measure of its distribution. Chapters 3-5 are country studies 
establishing the robustness of the basic relationship. Chapter 6 concludes the 
book with an analysis of worldwide fertility along the lines suggested above. 

Each of the country studies uses some multivariate analysis based on 
household data as well as more aggregated descriptive statistics. In general, 
the regressions are less satisfying than the descriptive analyses. In part this 
may be because of difficulties obtaining data for the appropriate concepts and 
interpreting results based on imperfect data. Largely, however, it is because 
the separate country studies have not been well integrated into a coherent 
whole. Although the relationship between income and fertility is the only 
point at issue, a fairly complex country-specific model is needed in order to 
isolate that relationship. Despite the claim that chapter 2 develops a general 
model, each chapter contains its own model which is not clearly related to the 
“general” one. Moreover, the development of these models, the discussions of 
the data, and the interpretation of results are all too cursory to be satisfying, 
even though the limited purpose of this study may not warrant a full country 
analysis for each case. The author’s analysis is much more convincing when he 
is using his model to tell us a story about country experiences. We are thus left 
with the impression that he may know more than he has revealed in his 
country studies. 

The case study of Puerto Rico has relatively little descriptive analysis; it uses 
1970 census data to estimate the magnitude of an income effect on fertility for 
households at different income levels and to provide empirical confirmation 
of the nonlinear relationship. The second case study shows how such a 
relationship can be used to interpret the Korean experience: a major redis- 
tribution of income followed by both growing income and declining fertility. 
This is the most thorough of the case studies, and it is perhaps no coincidence 
that the regression model is limited to a few pages at the end. 

By far the most disturbing part of this book is the chapter entitled “Internal 
Policies for Income Distribution in Rural India.” The data from Matar Taluka 
(a rural area in India’s Gujarat state) are fascinating in that they resurvey an 
area for which data were also collected in 1930 and so permit an analysis of 
long-run changes in income distribution. These data are used as a point of 
departure for an analysis of the benefits of land reform; fertility comes up 
only once, in passing, when the author asserts that redistribution through 
land reform will have the desired income effect. The discussion of land 
reform which takes up fully half of this chapter is well presented and argued, 
but its relevance to the main theme or to anything else in this book is 
established tenuously at best. First, an income regression equation is esti- 
mated for Matar Taluka in which both human and nonhuman assets are 
included as independent variables, and it is found that variation in the 
nonhuman asset variables accounts for by far the largest portion of the 
explained variation in income. However, the author tells us that Matar Taluka 
is an agricultural region with large variation in the size of landholdings and 
that the survey year was a disastrous agricultural year for most of India; 
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ceteris paribus, this should lead us to expect a wider-than-usual distribution 
of income correlated with each farmer’s harvest within the region. At the 
same time, we are told why the human capital variables will not predict 
income: because there is no variation in schooling within the region and 
because age pertains to the household head instead of the individual earner. 
Thus the empirical results for Matar Taluka may tell us what was going on 
during the survey year but certainly cannot justify choosing land reform over 
human capital development as a long-term strategy. 

Repetto follows his analysis of the effects of land reform with some argu- 
ments why education cannot be effectively used to redistribute income 
through human capital stocks, even though that is in fact what he says has 
happened in Korea. His arguments often confuse inequality measures with 
the distribution of income. For example, since the initial distribution of 
schooling in Matar Taluka is very equal, the first-round impact of increasing 
the education level of younger cohorts will be to increase inequality of earn- 
ings, presumably because education improves earnings. Even ignoring poten- 
tial intergenerational transfers, this is surely a most extraordinary argument 
against progress, and one which I doubt would impress many low-income 
families. In the same vein, the following paragraph contains the suggestion 
that since improved education would affect only new entrants to the labor 
force, it would be decades before their numbers could be great enough to 
appreciably affect the average income. Then, having observed that an im- 
portant value of education is that it “prepares students . . . for jobs outside the 
traditional sector,” he says that land reform is needed because education “has 
more value for a future landholder than for a future agricultural laborer” (p. 
149). 

The irony of this analysis of the relative merits of human and nonhuman 
assets as sources of increased income for the poor is that although it is a small 
part of Repetto’s book it is strategically placed at a point where policy implica- 
tions are being drawn from research findings. The fact that the discussion of 
this point is at best superficial is especially puzzling in the light of the com- 
paratively careful reporting of evidence in the earlier chapters. Indeed, on 
the whole this book contains a useful perspective on the importance of income 
effects on fertility and provides convincing evidence that the improvement of 
incomes for the poor, especially in developing countries, is necessary and 
even perhaps sufficient as a condition for reduced aggregate fertility. 


CaRMEL U. CHIswick 
Unwersity of Illinois at Chicago Circle 


The Economics of the Performing Arts. By C. D. THRosspy and G. A. WITHERS. 
New York: St. Martin’s Press, 1979. Pp. 348. $27.50. 


Books on the economics of the performing arts are no longer a rarity. But this 
volume contains material quite different from that of its predecessors. It goes 
over and brings up-to-date the statistics on costs, revenues, and public sup- 
port which have been analyzed before. But, in addition, it seeks to go as far as 
the subject allows in applying the standard techniques economists have used 
to analyze other industries—adapting theoretical micro models and econ- 
ometric analysis to the problems of the arts. 
Predictably, the success of this venture is mixed. But this is no discredit to 
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the authors, who are obviously masters of both the subject area and the 
techniques, and who display consistent good judgment on issues of public 
policy, which is the book’s ultimate focus. : 

The volume deals with all of the subjects which one would expect it to 
consider, starting with the organizations which present performances (“the 
firms”), going on to consumer demand, industry structure, and then turning 
to public assistance to the arts. 

Most subjects are treated in four different ways. Each is discussed in 
general terms to describe the most pressing issues to which it gives rise. 
Second, extensive and often illuminating descriptive statistics are provided. 
Third, where possible, a formal theoretical model is constructed. Finally, an 
econometric model is formulated and the values of its parameters are esti- 
mated. 

One can quibble here and there with the more discursive portions of the 
book, but, by and large, they are accurate and illuminating. Of particular 
value is the extensive material comparing performing activity and its financ- 
ing in Australia, Canada, Great Britain, New Zealand, and the United States. 
The value of such international comparisons has often been asserted, but in 
the past the materials provided for the purpose have generally been quite 
spotty. The major conclusions that emerge from the comparisons should, 
however, hardly be surprising to anyone with some acquaintance with the 
field. They show remarkable similarity in the economic problems faced by the 
arts in the different countries, in the characteristics of the audiences, in 
the nature of the organizations, etc. For example, it is “striking” that “proportions 
of the population exposed to the performing arts are substantially higher for 
the middle-aged, high income, high education, professional, managerial and 
white-collar groups” (p. 96). About the only major difference seems to be the 
heavy reliance on private foundations and individuals for financial support in 
the United States, while elsewhere such income flows almost exclusively from 
government. Far more remarkable are the figures on the growth of govern- 
ment support in the five countries studied. For the 8 years 1969 through 
1976, in every one of the countries government assistance grew faster than 
price level, population, average hourly earnings of labor, GDP, and (with the 
exception of Canada) aggregate government expenditure. In Australia and 
the United States the degree by which government outlays on the arts out- 
paced these other indices was particularly remarkable. All of this is notewor- 
thy because in the past periods of inflation have been times of financial 
difficulty for the arts. As a matter of fact, more recent data for the United 
States indicate that government outlays on the arts have continued to rise in 
real terms, so that while the arts have experienced financial difficulties stem- 
ming from other sources, at least so far, and with some noteworthy excep- 
tions, real cuts in government funding have not been among them. 

The authors’ discussion of policy deals with such issues as justifications of 
assistance, the determination of the appropriate assistance levels and of the 
allocation of the funds by size of organization, art form, geographic location, 
etc. In discussing justifications for support the authors build their conclusions 
on the economists’ usual grounds for government intervention: externalities, 
redistribution, merit goods, option demand, etc. The discussion strikes one as 
sensible, though no surprising conclusions emerge. They reconfirm what 
economists had noted before—how difficult it is to provide a very strong 
justification for government support of the arts in these terms except perhaps 
on the merit goods argument. But, then, the term “merit good” merely 
becomes a formal designation for the unadorned value judgment that the arts 
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are good for society and therefore deserve financial support. In other words, 
the merit good approach is not really a justification for support—it merely 
invents a bit of terminology to designate the desire to do so. 

As already noted, the unique contribution of this book lies in its theoretical 
and econometric models, and these are, therefore, worth considering in a bit 
more detail. These models deal with a variety of subjects: consumer demand 
for leisure (following Buristam Linder); willingness to offer philanthropy; the 
theory of merit goods; propensity to provide philanthropy; and the behavior 
of the arts firm and its response to subsidy. The last of these is perhaps 
representative. Taking the nonprofit arts firms to be managed by utility 
maximizers constrained by a zero-loss condition, the authors proceed to 
formulate a utility function whose variables are attendance and quality, a 
production function which takes attendance to be a function of number of 
performances, number of distinct productions, capacity of auditorium, and 
quality. This is related to a cost function, whose variables are attendance and 
quality, and a demand function dependent on price and quality. From this, a 
standard partial-equilibrium diagram emerges, showing the equilibrium of 
the utility-maximizing firm and its relation to that of a profit maximizer. The 
conclusions which emerge are not particularly surprising—the utility max- 
imizer’s output (audience size) will be the larger, and its output will be in- 
creased by subsidy—even if the subsidy is a lump sum. 

The econometric models also cover a wide variety of subjects: theater cost 
functions, consumer demand functions, voter characteristics, magnitude of 
government assistance, and the effects of grants on attendance. The relatively 
simple forms taken by the relationships used in estimation are fully explained 
by the complexity of the subjects and scarcity of data. More difficult to explain 
is the rather persistent failure of the authors to provide interesting and 
testable hypotheses. It is as though the statistical calculations were undertaken 
as ends in themselves rather than as means to get at more fundamental issues. 
Nevertheless, some very interesting results emerge. For example, the study of 
the effects of grants indicates: 


Firstly, grants to drama have been more effective at the margin in 
securing greater attendances than have grants to opera, ballet and 
music. Secondly, the marginal unit of subsidy in drama has had a 
relatively higher effect on attendances at low levels of grants per 
company, declining as the levels of subsidy increased, although this 
diminishing productivity at the margin is not so evident in the case of 
the other art forms. 

It must be remembered that these results obtain at the margin in a 
situation where the existence of grants already accounts for a major 
part of attendances, especially in the case of opera, ballet and music. 
. .. In the case of theatre, however, the wider range of levels of 
activity and the more profound effects of a relatively small amount 
of support on a small company give greater scope for grants to lead 
to increased attendances in the lower ranges, although at the other 
end of the scale (i.e., for the major subsidized companies) the margi- 
nal unit of grants is less effective in increasing attendances for 
reasons similar to those discussed above in relation to opera, ballet 
and music. [Pp. 268-69] 


To summarize, this book is a very useful addition to the literature. It will 
prove indispensable to anyone conducting research on the subject. The au- 
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thors have given us up-to-date information, have contributed valuable in- 
sights, and have expanded the armory of weapons shown to be helpful. But 
their attempt to fit the analysis to the procrustean bed of standard economic 
techniques seems to have been pushed beyond the point of diminishing 


returns. 
WILLIAM J. BAUMOL 


Princeton and New York Universities 
HiLpA BAUMOL 
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The Deterrent Effect of Antitrust Enforcement 





Michael Kent Block and Frederick Carl Nold 


Hoover Institution 


Joseph Gregory Sidak 


Stanford Law School 


In this paper we formulate and test a model of collusive pricing in 
the presence of antitrust enforcement. We show that a cartel’s opti- 
mal price is likely to be neither the competitive price nor the price 
that the cartel would set in the absence of antitrust enforcement but 
rather an intermediate price that depends on the levels of antitrust 
enforcement efforts and penalties. Our empirical results reveal that 
increasing antitrust enforcement in the presence of a credible threat 
of large damage awards has the deterrent effect of reducing mark- 
ups in the bread industry. 


Soon after the passage of the Sherman Act, the Supreme Court 
determined that horizontal minimum price fixing was so inherently 
injurious to consumer welfare that it should be illegal per se. Hori- 
zontal collusion has since become a major focus of federal antitrust 
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assistance, and we gratefully acknowledge the helpful comments of Donald Baker, 
Wiliam Baxter, Gary Becker, Timothy Bresnahan, Aaron Director, George Hay, Peter 
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0071, and 79-NI-AX-0071 from the National Institute of Law Enforcement and Crimi- 
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enforcement.! Through the use of criminal and civil sanctions the 
Department of Justice (DOJ) has sought not only to remedy specific 
instances of price fixing, but also to achieve general deterrence of 
potential price fixing. 

This paper is the first systematic attempt to estimate the impact of 
antitrust enforcement on horizontal minimum price fixing. We de- 
velop a simple theoretical model of the collusive pricing decision and 
then, using data on the bread industry, assess empirically the deter- 
rent effect of public and private antitrust enforcement on the decision 
to collude. 


I. A Simple Model of Collusive Pricing in the Presence of 
Antitrust Enforcement 


A. Definitions and Assumptions 


We construct a relatively simple model to consider explicitly the effect 
of antitrust enforcement on the decision of firms within an industry to 
fix prices collusively. The model uses the following variables and 
definitions: 


p= price of output; 
Q = Q(p) = demand for the industry’s output; 
C = C(Q) = total cost of industry output, including a 
normal rate of return; 
c = C(Q)/Q = average cost; 
mc = 0C/0Q= marginal cost; 
\ = (p — mc)/mc = markup over marginal cost; 
m= total profits of colluders; 
y = level of enforcement efforts directed toward 
detecting horizontal collusion; 

F = t(p — mc)Q= the combined civil and criminal penalty for 
price fixing, where ¢ is the anticipated dam- 
age multiple; 

d=d(\,y)=the probability that a collusive pricing 
scheme will be detected. 


The definitions of F and d are essential to our analysis and warrant 
further discussion. 

Our specification of the penalty function, F = t(p — mc)Q, reflects 
that under current statutes a price-fixing conspiracy is subject to both 
criminal and civil sanctions? The Clayton Act’s private treble dam- 


* Posner (1970, p. 398, table 23) reports that 989 of the 1,551 Department of Justice 
antitrust cases between 1890 and 1969 contained charges of horizontal conspiracy. 

? The maximum criminal sanctions for price fixing are imprisonment for 3 years, a 
fine of $100,000 per individual, and a fine of $1 million per corporation. We do not 
consider injunctive remedies in this model. 
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age remedy is the most formidable civil sanction and clearly relates to 
markups, as our specification of F reflects. If criminal sanctions are 
small—as they usually are—then civil sanctions provide most of the 
sting of antitrust enforcement, and our specification of F accurately 
describes the true penalty for price fixing.? Moreover, if, when 
sentencing price fixers, a district court judge considers the size of the 
markup to be an index of the cartel’s perniciousness, then F is still 
reasonably descriptive on the rare occasions when criminal fines and 
sentences are significant relative to private treble damage awards. 
In the presence of uncertain antitrust enforcement the decision to 
collude depends not only on the magnitude of the penalty but also on 
the probability of detection, which we assume increases with the 
markup, [dd(A, y)]/(@y) > 0. The closer colluders come to the price 
that they would charge if no antitrust enforcement existed, the 
greater their chances of being discovered. Two observations support 
this hypothesis. First and most obvious, higher markups make cus- 
tomer complaints to the DOJ more likely.* Second, as Stigler (1968, 
pp. 268-70) has noted, a positive relationship probably exists between 
a conspiracy’s detectability and its ability to prevent cheating by indi- 
vidual colluders. For example, a joint-sales agency would assure a set 
of colluders strict adherence to a monopoly price, but so visible a 
collusive device would be nearly assured of detection. In general, we 
hypothesize that the more efficiently a collusive device produces mark- 
ups, the more likely it will be detected and the colluders convicted.® 
Hence, as we have assumed, the very technology of collusion makes it 
likely that the probability of detection increases with the markup.® 
We also assume that the DOJ never charges noncolluding firms: 


* A comparison of the magnitude of criminal and civil penalties appears in Sec. II. 
Though a substantial increase in potential criminal sanctions was legislated late in 1974 
under the Antitrust Procedures and Penalties Act, there is little evidence of its effec- 
tiveness in increasing realized sanctions (see Burnham 1978). 

* Hay and Kelley (1974) find that complaints by customers are the third most numer- 
ous method of detection among DOJ price-fixing cases. 

° Stigler (1968) presents some indirect evidence on this point. In a sample of DOJ 
cases he divides the types of collusion between what he considers efficient and inefficient 
collusion. The average time from inception of alleged collusion to complaint is 
significantly shorter for the efficient forms of collusion. 

§ Our study of the bread industry revealed, as tentative empirical evidence of the 
relationship between the probability of detection and the markup level, that the prob- 
ability of being investigated for price fixing was positively related to the markup level. 
We assumed that a lag exists between the price-fixing decision and its detection; then, 
using a one-period lagged value of our markup indicator as an explanatory variable, we 
estimated a logit model of the probability that DOJ would initiate an investigation for 
bread price fixing. The estimated intercept for our 208-observation sample was —2.49, 
and the estimated coefficient on the lagged-markup indicator was 5.69 with a standard 
error of 2.43. When the logit function is evaluated using the sample mean values, this 
coefficient implies that a 1-percentage-point increase in the markup increases the 
probability of an investigation by 5 percent. Sec. II elaborates on our specification of the 
markup and the nature of our sample. 
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d(0, y) = 0. We assume further that DOJ does not necessarily detect a 
pure cartel price with certainty—that is, d(A°, y) < 1, where A¢ is the 
markup that colluders would choose if no antitrust enforcement 
existed. 


B. Antitrust Enforcement and Optimal Collusion 


The objective of collusion is to set a price that maximizes expected 
cartel profits.? To model this price-fixing decision simply, we impose 
three additional assumptions. First, firms produce output under con- 
ditions of constant marginal and average cost so that mc = c. Second, 
all firms take part in the joint-pricing decision. Third, the most 
significant cost of any collusive device is its impact on the probability 
of detection. For example, the major cost of a joint-sales agency is its 
visibility, not its resource costs. This third assumption implies that, in 
the absence of antitrust enforcement, firms could duplicate perfectly 
the price and output levels of a monopoly.® 

Formally, the colluders set the price p by maximizing the expected 
value of their objective function, Ez. That is, the colluders 


max Ea (lead are ats (a) 


where 7, = (p — c)Q(p) is the profit level if the colluders avoid 
detection, 7, = [p — c(1 + At)]Q(p) is the profit level if the colluders 
are detected, and \ = (p — c)/c is the markup under constant costs. 
The necessary condition for an internal maximum is 


»= — SE map) + (QW) + (p — OQ = dt) = 0.) 

Equation (2) has a simple yet intriguing implication: If, as we have 
assumed, the markup significantly affects the probability of detection, 
then neither the competitive price (A = 0) nor the pure cartel price 
markup (A = A°) can satisfy equation (2).? Antitrust penalties, there- 
fore, do not necessarily eliminate price fixing. They are, however, 
likely to reduce the optimal markup.'° 


7 This assumes that the colluders are risk neutral. An analysis that allows for risk 
aversion is presented in Block, Nold, and Sidak (1978). The results of that analysis do 
not differ substantially from those presented here. 

® As long as we restrict our attention to industries with a history of conspiracy, 
considering intraindustry cheating would complicate the model without significantly 
adding to our understanding of collusive pricing in the presence of antitrust enforce- 
ment. 

3 ; ve are cartel markup is the markup A° satisfying the condition Q(p) + (p — c) 
Q'(p) = 0. 

10 If the detection probability were not sensitive to the markup level, then the decision 
to price fix would be a simple either/or proposition. In this case, as long as (1 — dt) > 0, 
the colluders would set the price at the pure cartel level. Clearly no price fixing would 
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As long as price fixing is a favorable gamble for some markup, (1 — 
dt) > O0—and our assumptions assure it will be—then risk-neutral 
colluders obviously will not choose the competitive price. Also, al- 
though the “unfairness” of the price-fixing gamble at the pure cartel 
markup, [1 — d(A‘, y)t] < 0, is sufficient to dissuade the colluders from 
choosing A = A‘, it is not necessary. In general, colluders will stop 
marking up when both [Q(p) + (p — c)Q'(p)] and (1 — dé) are positive. 
To deter colluders from choosing the full cartel markup is rather 
easy; to deter all collusive pricing in a conspiracy-prone industry is 
nearly impossible. 


C. The Effects of Changes in Enforcement Efforts and Penalties on 
Collusiwe Markups 


To assess the effect on markups of increases in enforcement efforts, 
we consider the effect of changes in y on the optimal X, or dA/dy. 
Totally differentiating equation (2) with respect to A, we obtain 


Or t 07d od ; 
aa ee 
where Z,, is the second-order derivative. Only in the rather perverse 
case where an increase in enforcement actually reduces the impact of 
the markup on the detection probability—that is, where (07d)/(0\0y) 
< 0—does an increase in enforcement have an ambiguous effect on 
the markup. In most cases of practical importance we expect that DOJ 
efforts designed to increase the probability of detection also reduce 
the markup. 

Increasing price-fixing penalties is often advocated as an efficient 
means to achieve deterrence.'! The effect of such an increase is 


expressed by the comparative static derivative: 


— = 4 NO + dlQ(p) +(p — JQ'(pyIt, (4) 


which indicates that a higher penalty unambiguously decreases the 
optimal markup. In summary, an increase in either the probability of 


occur if (1 — dt) < 0. Enforcement efforts would determine simply whether or not it 
paid for a cartel to fix prices. If it did, colluders would set markups at the full cartel 
level; if it did not, they would not collude. This model, in which the probability does not 
depend on changes in A, would make the empirical analysis somewhat more 
straightforward. However, such a formulation would be less descriptive of the actual 
situation confronting colluders. Formally, our assumptions do not preclude a corner 
solution at A = A°. If the detection probability displays a limited responsiveness to 
markups, then eq. (2) may not be satisfied as an equality. However, it is unlikely, given 
the technology of collusion and the existence of punitive sanctions for price fixing, i.e.,¢ 
> 1, for (1 — dt) to be positive at A = A°. 

li See Sec. III; see also U.S. Office of the President 1969, and Elzinga and Breit 1976. 
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detection or the penalty for price fixing can be expected in most cases 
of interest to reduce the collusive markup. 


II. Empirical Findings 


Our theoretical model suggests that increases in enforcement levels or 
penalties for price fixing generally reduce collusive markups. Though 
straightforward to derive, this implication is hardly trivial to test. 
Unfortunately, we do not have a set of national, or even regional, 
industries having identical products, costs, and demand conditions 
but varying levels of antitrust enforcement. We must assess the impact 
of antitrust enforcement in a more problematic environment. 


A. Choice of Industry 


To test the implications of our deterrence model, we analyzed the 
market for white pan bread, a homogeneous commodity not only 
regionally produced and consumed but also well represented among 
DOJ price-fixing cases. During our sample period, bread cases were 
the most common among DO}J’s food price-fixing cases.” In addition, 
the bread industry has well-recorded annual input and output prices 
compiled for selected cities by the Bureau of Labor Statistics (BLS) 
(see U.S. Department of Labor, Bureau of Labor Statistics 1964—76, 
1965-76).'8 Hence, we constructed a sample that enabled us to use 
both cross-sectional and temporal variations in product prices, costs, 
and antitrust enforcement. 


B. Estimating City-specifuc Markups 


Developing an indicator for price markups in the bread industry 
involved two steps. First, we examined a standard recipe for a loaf of 
white bread.!* We subtracted from each bread price observation the 


12 The FTC observed that the bread-baking and distribution industry has “the essen- 
tial characteristic of a conspiracy-prone industry—relatively few sellers in individual 
markets” (U.S. Federal Trade Commission 1967, p. 135). 

‘8 The BLS publications contained comparable price data for white bread for 20 
major cities. For 12 of these cities the data went back as far as 1955, although for the 
remaining eight they extended only to 1968. Input price data, though not so complete, 
proved adequate to construct a sample of 228 observations: 12 major cities for 
1964-76, and eight additional major cities for 1968-76. Cities included throughout the 
entire period (1964-76) were Baltimore, Boston, Chicago, Cleveland, Detroit, Los 
Angeles, New York, Philadelphia, Pittsburgh, St. Louis, San Francisco, and 
Washington, D.C. For 1966-76, we were able to add Atlanta, Cincinnati, Dallas, Hous- 
ton, Kansas City, Minneapolis, San Diego, and Seattle to the sample. 

14 The cost of ingredients (IC) for a 1-pound loaf of white bread is IC = .6350P;, + 
.0571P, + .0026P, + .035Py, where P, is price/pound of flour, Ps is price/pound of 


ANTITRUST ENFORCEMENT 435 


AB Ee 


EFFECT OF NONINGREDIENT INPUT Costs 
ON ADJUSTED BREAD Price (PADJ) 











Independent Variables 





PELEC .256* 
(4.62)+ 
PGAS 2.49 
(3.15) 
LABOR 1.07 
(7.14) 
Constant 7.05 
Observations (N) 228 
R? 49 
F-statistic (3,224) 72.93 





* The estimated coefficient. 
7 The value of the estimated coefficient divided by its estimated standard error. 


component attributable to the cost of the ingredients, IC, calculated 
from the recipe and from BLS ingredient prices. Formally, we 
defined the recipe-adjusted bread price, PADJ, as: 


PADJie = Pie — 1Cit, (5) 


where 7 is the city and ¢ the time index.'® Differences in profits and 
other noningredient input costs cause PADJ to vary across cities and 
over time. 

As a second step in developing a measure of markups, we estimated 
the amount of variation in PADJ attributable to variations in energy 
and labor costs. Table 1 contains the results of this analysis; PELEC 
and PGAS are measures of electricity and natural gas prices, and 
LABOR is a measure of wage rates for truck drivers. We assume this 
wage to be a proxy for general labor costs.’® The results in table 1 
suggest that variations in energy and labor input costs account for a 
reasonable amount of the variation in adjusted bread prices." 


sugar, Py the price/pound of cooking oil, and Py is the price/pound of dry milk. All 
prices are retail, except dry milk, which is wholesale. Cooking oil is used as a proxy for 
shortening (U.S. Executive Office of the President 1977, p. 7). 

‘5 The sources of price data were U.S. Department of Labor, Bureau of Labor 
Statistics (1965-76; 1969). 

'6 Specifications using food and kindred workers’ wages as well as alternative func- 
tional forms were also estimated. These different specifications yielded similar results 
(see Block et al. 1978). 

17 Of course, measures of goodness of fit for this regression substantially understate 
the importance of variations in input prices since the dependent variable, PADJ, is 
already net of the recipe or ingredient. Our entire procedure, taking account of both 
IC and the estimated cost function, accounts for approximately 82 percent of the 
variation in the price of bread (p). In general, prices for all significant inputs are 
included in cost functions. Our asymmetric treatment of input prices amounts to using 
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To construct a measure of the markup on bread, we then used the 
recipe costs and the results of this regression to estimate M, the 
markup indicator: 


M = [p — (NIC + IC)J(NIC + IC), (6) 


where all subscripts are suppressed and NIC refers to the fitted values 
generated by the equation in table 1.18 Equation (6) estimates the 
markup by first subtracting from the market price the sum of the 
known ingredient costs (IC) and an estimate of other noningredient 
costs (NIC), and then dividing this difference by estimated unit costs. 
The higher the residual as a proportion of unit costs, the higher the 
markup. 


C. Estimating the Effect of DO] Antitrust Enforcement 
on Collusive Markups 


Antitrust enforcement and penalties are only two of the many fac- 
tors that actually determine markup levels.'® Certainly, all factors 
that influence the price elasticity of demand also influence the mark- 
up level. Moreover, although we have assumed antitrust penalties to 
be the only cost of collusion, the resource cost of managing and 
policing a cartel also influences the optimal markup in the presence of 
antitrust enforcement. Assessing the effect of antitrust enforcement, 
therefore, requires a workable method of controlling for such outside 
influences. Influences generated by aspects of market structure that 
do not change rapidly—or at least that change significantly less 
rapidly than the antitrust variables—can be controlled by considering 
not the markup level but changes in the markup level.?° We therefore 
used first differences in the markup level, or 


AM Mi eV (7) 


in testing the effectiveness of antitrust enforcement.”! 


a priori information—the recipe for bread—to supplant estimation of some 
coefficients. It may be argued that this procedure increases the efficiency of the 
estimation, but it precludes substitution among raw materials and other inputs. 

18 Because a simple regression generates NIC, approximately half the observations 
for NIC exceed PADJ and thus generate a negative M. For our purposes, these 
negative values pose no problems. 

19 We structured our test of antitrust enforcement around markups rather than the 
traditional—but indirect—test of overall profitability or rate of return. One reason for 
using the markup was the very directness of the test. Also, the availability of regional 
and city data enabled us to conduct more powerful tests of the effectiveness of antitrust 
enforcement than could have been performed with available data on rates of return. 

*0 Controlling for these factors by actually assembling the relevant information on 
market structure for this industry at the city or SMSA level was not feasible. 

*1'This procedure also facilitated our use below of a straightforward measure of 
regional antitrust enforcement. 
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1. Measuring Detection Probabilities, DOJ Enforcement Efforts, 
and Remedial Effects 


We assume in our formal model of collusion that for any markup level 
the probability of detection is a function of DOJ enforcement efforts. 
Unfortunately, this straightforward theoretical proposition does not 
suggest a unique empirical counterpart for measuring either the 
probability of detection or DOJ’s enforcement efforts. This formula- 
tion does suggest, however, that for any markup level the probability 
of detection is related to the capacity of the Antitrust Division to 
initiate cases. To the extent that the Division’s litigation capacity 
relates to its expenditure level, the Division’s annual budget should 
provide an indirect measure of this enforcement capability.” 

A more direct measure of enforcement is simply DOJ price-fixing 
prosecutions. A price-fixing prosecution is rare enough for colluding 
firms in the affected industry to take special notice.”? We have as- 
sumed, therefore, that each price-fixing case that the DOJ brings 
against a bread producer increases, for any markup level, the prob- 
ability of prosecution perceived by other bread producers operating 
in the same DOJ region.” 

We had sufficient price data to estimate changes in markups for 
1965-76. We constructed a regional antitrust enforcement variable, 
DOJREG, for this period by setting the variable equal to one for each 
city within a region where the Antitrust Division filed an action that 
year—except for the city incurring the action—and by setting the 
variable equal to zero otherwise.2* In other words, DOJREG is a shift 


22 A positive and significant relationship exists between the Antitrust Division’s an- 
nual budget and the number of price-fixing cases brought by the Division. Using data 
on price-fixing cases during 1964-76 supplied by the Antitrust Division’s Economiic 
Policy Office, we estimated the relationship: 


DOJPF = —11.39 + .003BUDGET, 
(2.42) 


between the number of price-fixing cases brought annually by the Division (DOJPF) 
and its budget (BUDGET) measured in thousands of 1967 dollars (2.42 is the t-statistic). 

23 We can provide a formal rationale for this observation by assuming that colluders 
use Bayesian methods to estimate the probability that they will be apprehended in a 
particular period. In this formulation, whenever colluders are apprehended, colluders’ 
estimate of the probability of apprehension increases, and that increase is dramatic if 
their a priori distribution is diffuse and has a small mean. Also, after an initial impact, 
the effect of the case on the apprehension probability estimated by the colluders 
deteriorates if no new cases are brought. 

24 The Antitrust Division operated seven regional offices during the years of our 
sample: New York, Philadelphia, Cleveland, Chicago, San Francisco, Los Angeles, and 
Atlanta. The Antitrust Division established a Dallas office in 1976 and realigned the 
areas covered by the regional offices. This study used the earlier seven regions for 1976 
rather than realigning all regions for that single year. For coverage of regions, see U.S. 
Department of Justice, Antitrust Division (1973). 

25 This variable was constructed using data from the Commerce Clearing House 
(1955-75; 1966-76). A summary of those data appears in Block et al. (1978). Filing 
dates and other summary information were checked against a special listing of bread 
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variable designed to capture the changes in the perceived prob- 
abilities of cartel failure. Its form presumes that the colluders’ esti- 
mate of the probability that DOJ will initiate a case for collusion 
increases after a DOJ price-fixing action within the same region.”6 

The variable DOJREM measures the effect of the antitrust action in 
the city where the Antitrust Division actually prosecuted a bread 
producer. In constructing DOJREM, we assumed that the impact of 
the antitrust action on the firms specifically prosecuted would differ 
significantly from the impact on firms in other cities in the same 
region. Basically we assumed that, for strategic reasons, the timing of 
the reduction in markups by prosecuted firms would not coincide 
with the reduction made by firms in nonaffected cities. To capture 
this effect, we set DOJREM equal to one in a city | year after the 
Antitrust Division had filed an action there. 


2. Estimated Deterrent Effects 


In table 2 we present our estimates of the effect of DOJ enforcement 
variables on markups in the bread industry. Again, the dependent 
variable, AM, is the annual change in the markup on white bread for 
the cities in our sample. The results in table 2 strongly suggest a 
deterrent effect of DOJ enforcement efforts.”’ First, the coefficient on 


price-fixing cases prepared for us by the Economic Policy Office, Antitrust Division, 
U.S. Department of Justice, for the years 1963-76. 

26 An alternative hypothesis would be that, since DOJ has limited resources, a prose- 
cution in a region signals that an additional prosecution in the same region is unlikely. 
Hence a firm actually would reduce its estimated probability of detection after a DOJ 
case in its own region. This hypothesis is equivalent to predicting that drivers on a 
highway accelerate when they see a car already being ticketed. 

27 Our formulation, which posits that increases in the markup increase the detection 
probability, generates the structural equations: Ad = € + BAX + tABUDGET + 
nDOJREG and Ad = + aAd + OBDOJREM. Therefore, the reduced forms are 








Ad =o + 3 ABUDGET +5 1 DOJREG + [gg DOJREM 
= 10, 











Oy — aB 
and 
Av =p + — 2 ABUDGET # = DOIREG ae 
Peis [tg DOJREG + 7S, DOJREM. 


Our estimated reduced-form coefficients in table 2 are of the form (az)/(1 — a), 
(an)/(1 — af), and (@)/(1 — a). We are mainly interested in deducing the sign of a 
given the sign of the reduced-form coefficients. The usual approach would be to 
estimate the other reduced-form equation in the system which takes Ad to be a function 
of the exogenous variable. Unfortunately this approach cannot be applied directly to 
this problem. We do not have a proxy for Ad or measures of the detection probability at 
a regional level. Even at a national level measuring d is problematic. However, given our 
formulation of the determinants of detection, if we assume that DOJ expenditures have 
some efficacy—i.e., 7 > 0—then the condition necessary to deduce the nonpositivity of 
a from the reduced-form coefficients is that (1 — a@B) > 0. This condition is required for 
Walrasian stability in the collusion market and in conjunction with our reduced-form 
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TABLE 2 


ESTIMATED EFFECTS OF CHANGES IN DOJ ENFORCEMENT ON 
CHANGES IN MARKUPS IN THE BREAD INDUSTRY, 1965—76 











Independent 
Variables 
ABUDGET —.015* —.015 —.024 —.020 
(—2.74)t (—2.68) (—4.06) (—3.65) 
DOJREG 025 —.026 —.025 = AUP 
(—2.05) (32 i)) (—2.09) (—2.26) 
DOJREM —.046 — .046 —.044 
(=2.32) (—2.41) (2232) 
AFOODM +.058 ee 
(2.33) 
AGENM —.010 
(— 1.60) 
Constant On .013 .014 .017 
Re -055 .082 3, KON 
F-statistic 5.93 (2,205) 6.04 (3,204) 6.47 (4,203) 5.68 (4,203) 





Nore.—Each regression is based on 208 observations. 
* This coefficient is estimated per million dollars. 
+ The value of the estimated coefficient divided by its estimated standard error. 


our general measure of changes in enforcement capacity—the change 
in the real value of the Antitrust Division’s budget, ABUDGET—1is 
negative and significant. In other words, an increase in the enforce- 
ment capacity of the Antitrust Division appears to reduce markups on 
white bread. Second, the coefficient on our direct measure of DOJ’s 
enforcement activity (DOJREG) is negative and significant, suggesting 
that a price-fixing case against bakers in one city induces bakers in 
neighboring cities to reduce markups. This result comports both with 
our formal theoretical results and with the related conjecture in the 
Stigler report that “every victory” in seeking out price fixing “weakens 
the efficiency of undetected collusion.” Finally, the coefficient on 
DOJREM, the variable measuring the remedial effect of a price-fixing 
case, is negative and significant. Once discovered and prosecuted, 
colluders apparently “remedy” their price fixing by reducing their 
markups in the following year.” 


results implies that there is a deterrent effect, i.e., a < 0. In addition to this theoretical 
restriction on the sign of (1 — a@), we noted previously that there is a positive 
relationship between the total number of price-fixing cases brought by the Antitrust 
Division and the Division’s budget level. If we can assume that other collusion-prone 
industries made decisions about collusion in the same way as bread producers, then 
this aggregate regression gives information about the second set of reduced-form 
coefficients. Again, if 7 > 0, the sign of the reduced-form coefficient on ABUDGET 
indicates that (1 — a@f) is positive. 

28 See U.S. Office of the President, President Nixon’s Task Force on Productivity and 
Competition 1969. 

29 It is only necessary to assume that DOJ capacity is productive (7 > Q) to infer from 
our empirical results that n > 0, or that filing of a DOJ case in the region increases d. 
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The AFOODM and AGENM variables in table 2 control for general 
year-to-year variations in manufacturing markups that might not be 
adequately controlled by the first-difference procedure. The variable 
AFOODM is an annual series of the first differences in markups of 
food-and-kindred-product manufacturers; AGENM is an analogous 
series for all manufacturing firms.*° The coefficient on AFOODM is of 
the expected sign, suggesting that, holding enforcement constant, 
markups in the bread industry move in the same direction as markups 
in other food-related industries. Although using the general man- 
ufacturing markup (AGENM) in the equation does not alter the 
estimates of the coefficients on the enforcement variables, the sign of 
the coefficient on AGENM is curious. 


D. Class Actions and the Effect of Antitrust Enforcement on Markups 


Historically, trial judges have punished price fixers leniently.*t The 
cases collected for this study were no exception. All but two of the 17 
bread price-fixing cases between 1957 and 1976 involved nolo conten- 
dere pleas, and in only one case did a defendant serve an actual prison 
sentence; moreover, total fines as a percentage of the pretax profits of 
the colluding firms averaged only 7 percent.*” Neither imprisonment 
nor monetary penalties posed a credible threat to colluding firms. We 
hypothesized, therefore, that the deterrent effect of DOJ’s enforce- 
ment efforts came not from the threat of publicly imposed fines or 
imprisonment, but from the increased likelihood of an award of 
private treble damages to bread consumers or distributors.** 


30 Both series were computed from data reported by the Federal Trade Commission 
in Quarterly Financial Report for Manufacturing, Mining, and Trade Corporations, in U.S. 
Council of Economic Advisers, Economic Report of the President (annual). Of course, both 
markup measures are influenced by antitrust enforcement efforts to the extent that 
collusion is important in these manufacturing industries. 

31 E.g., during 1966—76 over 85 percent of all price fixers who were convicted or who 
pleaded nolo contendere did not serve prison terms (see Block et al. 1978). 

32 For the 16 criminal cases involving either a nolo plea or a conviction the actual 
criminal fines for price fixing were: 











Minimum Maximum Average 
Individuals ($) 1,031 9,966 4,025 
Firms ($) 2,930 50,638 20,690 
Fines, as % of defen- 
dants’ annual sales .09 1.0 col 





All fines are in 1976 dollars. Data on the ratio of fines to sales were derived from court 
documents relating to 13 cases (76 firms) during 1957-75. The overall ratio of net 
profits to sales was obtained from a study by the U.S. Executive Office of the President, 
Council on Wage and Price Stability (1977). 

33 Civil actions alleging horizontal price fixing are possible but, certainly in this 
industry, uncommon without a preceding criminal case. 
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Since bread price fixing generally causes a small injury to many 
individual distributors and consumers, private damage recovery is 
usually feasible only through a class-action suit. Class actions enable 
plaintiffs who are numerous, and whose independent damage claims 
are too insignificant to justify litigation, to maintain a single action for 
their aggregate damages.** District court documents revealed that 
settlements in class actions for price fixing in the bread industry were 
almost 10 times greater than government-imposed fines.?° 

To test the hypothesis that private class actions actually provided 
the effective penalty in price-fixing cases, we partitioned our sample 
into the periods before and after class actions became a credible threat 
in the bread industry. Since only one class action in our sample did 
not follow a DOJ case, we assumed in partitioning the sample that 


34 These actions, brought under Rule 23 of the Federal Rules of Civil Procedure, 
have become far more frequent since the Supreme Court amended that rule in 1966. 
The amended Rule 23 contributed to this greater frequency by changing the procedure 
for becoming a member of the class. Originally, Rule 23 required persons to “opt into” 
the class before they could benefit from the adjudication of the class’s cause of action; 
the amended rule instead presumed persons to be class members unless they affirma- 
tively “opted out” of the class. Although surprisingly few historical data exist on class 
actions, the data that do exist—particularly the information on docket entries for the 
Southern District of New York collected by the American College of Trial Lawyers’ 
Special Committee on Rule 23—suggest that the 1966 amendments to Rule 23 made 
the class action a much more attractive legal device (American College of Trial Lawyers 
1972). A recent alternative to the antitrust consumer class action is the parens patriae 
device by which a state attorney general may sue on behalf of the consumers in his state. 
For the purposes of deterrence, parens patriae actions are virtually identical to class 
actions (see Block et al. 1978). 

35 A search of Commerce Clearing House (1955-75; 1966-76), McLaughlin (1976), 
Newberg (1977), and the LEXIS computer file (1978) revealed seven major class actions 
for price fixing in the bread industry since 1966. According to district court documents, 
class-action settlements or damages during 1971-76 obtained for price fixing in the 
bread industry were: 











Minimum Maximum Average 
Cases ($) 1,197,810 6,100,000 1,998,646 
Firms ($) 39,562 1,220,000 293,919 
Damages as % of defen- 
dants’ annual sales Al 19.68 2.87 





All fines are in 1976 dollars. These figures are based on data for five of the seven 
recorded cases. One of the seven original class actions had no settlement or award since 
the district court refused to certify the class. No data at all were available on another 
case. All five cases were settled rather than litigated to judgment. Details of these class 
actions appear in Block et al. (1978, appendix table VI). Whereas average damages as a 
percentage of defendants’ sales were only 10 times average fines as a percentage of 
defendants’ sales, average damages per firm were almost 15 times average fines per 
firm. This occurred because the average sales per defendant differed significantly 
between the two samples: $5,795,271 for the fine calculations and $10,254,205 in the 
class-action calculations. 
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TABLE 3 


EsTIMATED EFFECT OF DOJ ENFORCEMENT AND CLAss ACTIONS 
ON MARKUPS IN THE BREAD INDUSTRY, 1965-76 











Independent 
Variables 
ABUDGETI11 —.018* 
(—.622)i, 
ABUDGET12 —.014 
(—2.68) 
ABUDGET21 ey —.002 
(—.158) 
ABUDGET22 ee —.019 
(=3.53) 
DOJREGI11 +.004 Esk 
(.160) 
DOJREG12 —.037 
(2.7.0) 
DOJREG21 Lee —.019 
(—1.03) 
DOJREG22 ae —.029 
(les) 
DOJREM — .046 —.042 
(—2.43) (—2.19) 
Constant .013 .013 
Observations (N) 208 208 
Re .0926 .1024 
F-statistic 4.13 (5,202) 4.60 (5,202) 





* This coefficient is estimated per million dollars. 
+ The value of the estimated coefficient divided by its estimated standard error. 


class actions affected primarily the penalty cost of detection, not the 
probability of detection.** We considered as partition dates the year 
that a district court first certified a class in a bread case (1970) and the 
year that the Administrative Office of the Courts began reporting 
class-action activity (1972). Before 1970 class actions probably were 
not a credible threat in the bread industry; by 1972 no doubt re- 
mained that class actions had become prevalent and important. To 
accomplish the actual partitioning of the sample, we split the en- 
forcement variables ABUDGET and DOJREG around 1970 and 1972. 
In table 3, ABUDGET11 and DOJREGI1 are ABUDGET and DOJ- 
REG for 1965-69, while ABUDGET12 and DOJREG12 are the vari- 
ables for 1970-76. The comparable variables partitioned around 


36 To determine when, after the 1966 amendments to the class-action provisions in 
the Federal Rules of Civil Procedure, the class action became a credible threat in the 
bread industry, we searched for both recorded and unrecorded class actions involving 
horizontal price fixing of bread products. None of the seven class actions we found was 
filed before 1968, actually certified before 1970, or settled before 1971. 
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1972 instead of 1970 are ABUDGET21, ABUDGET22, DOJ- 
REG21, DOJREG22.°" 

The estimates in table 3 are consistent with our hypothesis that class 
actions represent the effective penalty in price-fixing cases. This re- 
sult does not seem to depend upon whether we partition the en- 
forcement variables around the date of the first class certification or 
around the date of the first reporting of class-action activity by the 
Administrative Office of the Courts. For either partition, only in the 
latter period, when class actions represented a credible threat, did a 
significant deterrent effect result from either an increase in the Anti- 
trust Division’s resources or from the actual prosection of a horizon- 
tal price-fixing conspiracy.** 

The pattern in the bread industry is clear. A successful federal 
prosecution signals to consumers that a treble damage suit has be- 
come feasible. Private plaintiffs subsequently provide the effective 
penalty in the form of class actions for treble damages.*® For price 
fixing in this conspiracy-prone industry—and, we suspect, for price 
fixing in general—deterrence has been a product of both public and 
private enforcement efforts.* 


37 Formally, the definitions of the variables are: ABUDGET21 = ABUDGET, 
1966-71, zero otherwise; ABUDGET22 = ABUDGET, 1972-76, zero otherwise; 
DOJREG21 = DOJREG, 1966-71, zero otherwise; DOJREG22 = DOJREG, 1972-76, 
zero otherwise. 

38 While the estimates in table 3 do not include the markup controls, AFOODM and 
AGENM, regressions with these controls yielded similar results. Several factors possibly 
confound this analysis of the deterrent effect of class actions. First, the average criminal 
fine imposed by district courts might have increased over the period, or might simply 
have been larger when class actions became a relevant concern. To check for this 
possibly confounding influence, we estimated time trends for several measures of 
criminal fines. Overall, the evidence suggests that an increase in criminal penalties did 
not confound our results on the deterrent effect of class-action suits. Second, the early 
1970s included a period of price controls, and DOJREG12 and/or ABUDGET12 possibly 
proxied for the effect of these controls. We tested whether the enforcement variables 
performed this role by including directly in the markup regression a dummy variable 
for price controls. The results of this procedure suggest that, although price controls 
had a significant depressing effect on markups, the enforcement variables were not 
merely a proxy for price controls. Finally, we controlled for pure time trends associated 
with changes in demand or market structure within each city. This was accomplished by 
regressing the dependent variable (AM) against city dummies along with the deterrent 
variable. The effect of the enforcement variables remained significant and unchanged 
and nearly all of the city dummies were statistically insignificant (see Block et al. 1978). 

39 A nolo plea apparently is a sufficient signal. Although guilty verdicts are prima facie 
evidence in a treble damage suit and should induce private enforcement, they are rare 
in price-fixing cases. In our sample of 17 price-fixing cases, one case ended in an 
acquittal and another in a conviction; the other 15 involved nolo pleas. Yet even a 
government case that ends in a nolo plea signals to potential private plaintiffs that they 
probably would prevail in a damage suit against the alleged price-fixing conspiracy. In 
fact, all the reported class actions in our sample that followed a government action and 
received class certification eventually obtained a settlement award. 

40 The amendment of Rule 23 in 1966, of course, was the event which facilitated class 
actions, thereby increasing the power of the federal government to deter price fixing. 
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III. Conclusion 


In this paper we formulated and tested a simple model of collusive 
pricing in the presence of antitrust enforcement. We showed that if a 
cartel’s probability of detection increases with its markup, then the 
cartel’s optimal price is neither the competitive price nor, in most 
cases, the price that a cartel would charge in the absence of antitrust 
enforcement, but rather an intermediate price that depends on the 
levels of antitrust enforcement efforts and penalties. 

Our empirical results revealed that increasing DOJ’s enforcement 
capacity or filing a DOJ price-fixing complaint had the deterrent 
effect of reducing markups in the bread industry. We noted that 
government-imposed price-fixing penalties were trivial and found 
support for the proposition that the effective deterrent to price fixing 
was the credible threat of large damage awards to private class actions 
that followed DOJ’s case against the same conspiracy. Consequently, 
only after class actions became a credible private remedy did the 
Antitrust Division’s enforcement capacity or its filing of a bread 
price-fixing case deter collusion in the conspiracy-prone bread in- 
dustry. 


References 


American College of Trial Lawyers. Report and Recommendations of the Special 
Committee on Rule 23 of the Federal Rules of Civil Procedure. Los Angeles: 
American Coll. Trial Lawyers, 1972. 

Block, Michael K.; Nold, Frederick C.; and Sidak, J. Gregory. “The Deterrent 
Effect of Antitrust Enforcement: A Theoretical and Empirical Analysis.” 
Technical Report no. ISDDE-1-78, Center for Econometric Studies of the 
Justice System, Hoover Institution, Stanford Univ., December 1978. 

Burnham, David. “Tougher Penalties in Antitrust Cases.” New York Times 
(March 16, 1978). 

Commerce Clearing House. Trade Regulation Reporter. New York: Commerce 
Clearing House, various issues, 1955-75. 

. Trade Cases. New York: Commerce Clearing House, various issues, 
1966-76. 

Elzinga, Kenneth G., and Breit, William. The Antitrust Penalties: A Study in Law 
and Economics. New Haven, Conn.: Yale Univ. Press, 1976. ; 

Hay, George A., and Kelley, Daniel. “An Empirical Survey of Price Fixing 
Conspiracies.” J. Law and Econ. 17 (April 1974): 13-38. 

McLaughlin, Joseph T., ed. Federal Class Action Digest 1976. New York: Prac- 
tising Law Inst., 1976. 





Landmark Supreme Court decisions regarding class-action suits such as Eisen v. 
Carlisle and Jacquelin (417 U.S. 156 [1974]) and legislation such as the Hart-Scott- 
Rodino Antitrust Improvement Acts are likely to have similarly far-reaching effects. 


Some very preliminary attempts to assess the impact of these events on collusion appear 
in Block et al. (1978). 


ANTITRUST ENFORCEMENT 445 


Newberg, Herbert B. Newberg on Class Actions: A Manual for Group Litigation at 
the Federal and State Levels. New York: McGraw-Hill, 1977. 

Posner, Richard A. “A Statistical Study of Antitrust Enforcement.” J. Law and 
Econ. 13 (October 1970): 365-419. 

Sugler, George J. The Organization of Industry. Homewood, Il.: Irwin, 1968. 

U.S. Council of Economic Advisers. Economic Report of the President. 
Washington: Government Printing Office, annual. 

U.S. Department of Justice, Antitrust Division. “Memorandum to All Person- 
nel. Re: Organization and Operation of the Antitrust Division.” Directive 
no. 10-73, September 30, 1973. 

U.S. Department of Labor, Bureau of Labor Statistics. Wholesale Prices and 
Price Indexes. Washington: Bur. Labor Statis., various issues, 1964-76. 

. Estimated Retail Food Prices by City. Washington: Bur. Labor Statis., 

various issues, 1965—76. 

. Retail Prices of Food, 1964-68. Bulletin no. 1632. Washington: Bur. 
Labor Statis., 1969. 

U.S. Executive Office of the President, Council on Wage and Price Stability. 4 
Study of Bread Prices. Washington: Executive Office of the President, April 
1977. 

U.S. Federal Trade Commission. Economic Report on the Baking Industry. 
Washington: Federal Trade Commission, 1967. 

U.S. Office of the President, President Nixon’s Task Force on Productivity 
and Competition. Recommended Changes in Antitrust Policies. Washington: 
Government Printing Office, 1969. 








The Market Evaluation of Human Capital: 
The Case of Indentured Servitude 





David W. Galenson 


University of Chicago and California Institute of Technology 


This paper examines the market for human capital created by the 
institution of indentured servitude in colonial America. The inden- 
ture system allowed English emigrants to obtain passage to the 
colonies by selling claims on their future labor. With the size of the 
debt approximately equal for all emigrants, the length of the term 
for which a servant was bound is predicted to have varied inversely 
with expected productivity in the colonies. Analysis of two collections 
of contracts made in the seventeenth and eighteenth centuries sup- 
ports the prediction. Age, skill, and literacy were negatively related 
to length of indenture. Women received shorter terms than men at 
young ages, while servants bound for the West Indies and those 
bound in periods of high colonial demand for labor also received 
reductions. 


I. Introduction 


During the colonial period of American history, two institutions 
existed which provided for the explicit valuation of stocks of human 
capital in the market. One of these was slavery, under which blacks 
and their progeny were held in service for life. The other was in- 
dentured servitude, under which whites were bound to service for 
limited periods of time. 


I am grateful to Stanley Engerman for discussions of many of the issues treated in 
this paper and comments on an earlier draft. I would also like to thank Andrew Abel, 
Gary Becker, Lance Davis, Robert Fogel, Richard Freeman, Russell Menard, Frederic 
Mishkin, Melvin Reder, Sherwin Rosen, T. W. Schultz, George Stigler, and participants 
in seminars at Columbia University, the University of Chicago, the University of 
Illinois, the Newberry Library, the University of Southampton, and the 1980 Cliomet- 
rics Conference for their suggestions and comments. 


(Journal of Political Economy, 1981, vol. 89, no, 3] 
© 1981 by The University of Chicago. 0022-3808/81/8903-0007$01.50 


446 


EVALUATION OF HUMAN CAPITAL 447 


As a result of both its direct importance for the history of the 
nineteenth century and its indirect importance for the history of the 
twentieth, slavery has long been one of the central concerns of Ameri- 
can social and economic historians. However, in the labor markets of 
many regions, for substantial periods during the first half of the 
colonial era, indentured servitude was of greater quantitative impor- 
tance than slavery. In the eighteenth century the quantitative 
significance of indentured servitude declined in most of these areas, as 
slaves were substituted for servants in the sugar fields of the West 
Indies, in the tobacco fields of the Chesapeake colonies, and in the 
rice fields of South Carolina.? Yet indentured servitude nonetheless 
continued to perform an important role in the colonial labor market 
by providing skilled craftsmen and managers to the large plantations 
of the West Indies and the southern mainland colonies. 

The essential difference between servitude and slavery was that it 
was the labor of the servant, rather than the person, which was bought 
and sold. Because less attention has been devoted to the study of 
indentured servitude, we are less familiar with the history of the 
institution. An economic investigation of its functioning, based on the 
analysis of quantitative data generated by the operation of the market 
for servants, can therefore serve a dual purpose: It can provide 
evidence of how capital values were established when claims on the 
labor of humans for long periods were traded in the market, while at 
the same time increasing our understanding of the economic basis of 
white servitude in colonial America.4 


II. The Indenture Bargain 


Indentured servitude was a credit system under which human labor 
was leased. It functioned through two markets linked by a recruiting 
agent. In England, in the first market, a prospective servant signed a 


1 An example is Maryland, where a study of probate inventories has indicated that as 
late as 1674—79 the ratio of servants to slaves held in estates was 3.88 (Menard 19778, p. 
360). It should be pointed out that in some areas neither type of bound labor was 
quantitatively important. Slaves never accounted for more than 3 percent of New 
England’s total population in the colonial period, and the share of servants was 
probably of a similar magnitude (Greene 1942, chap. 3; Abbot Emerson Smith 1947, 
pp. 28-29). A generalization that held for a number of major colonial regions is that the 
share of bound workers in a region’s labor force tended to rise as one traveled south 
from New England, with progressively increasing shares in the middle colonies, the 
Chesapeake, South Carolina, and the West Indies. For evidence on population 
composition, see Greene and Harrington (1932), Sutherland (1936), and Wells (1975). 

2For accounts of this process in the West Indies, see Dunn (1973); for the 
Chesapeake, see Menard (1977b); for South Carolina, see Wood (1975). a 

3 On this change, see Gray (1958, 1:350) and Pares (1960, p. 19); for additional 
evidence see Galenson (1979a, chap. 19). a 

4 The present paper extends the analysis and empirical findings of Galenson (1977). 
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contract, or “indenture,” with a merchant, promising to serve the 
latter or his assignees in a particular colony for a given period under 
stated conditions. The servant was then transported to the specified 
colonial destination, where the merchant or his representative sold his 
contract to a colonial planter or farmer in the second market. In 
return for the commitment of his labor, the servant received passage 
to the designated colony, maintenance during the term of the con- 
tract, and certain freedom dues at its conclusion. Once signed, the 
indenture was negotiable property, and at any time before its conclu- 
sion the servant could be sold to a new master for the balance of his 
term. When the contract expired, the servant became free. The con- 
ditions of servitude were regulated by colonial statutes as well as by 
agreements written into the contracts. The terms of the contract were 
binding upon both master and servant. 

The indenture system normally operated within a context of com- 
petitive markets both in England and in the colonies. Servants were 
one important available backhaul cargo for English ships engaged in 
the trade for colonial sugar, tobacco, rice, and other agricultural 
staples.» Contemporary fare quotations indicate that the charges for 
passage from England to America were uniform at a given time for all 
individuals and did not vary by specific colonial destination.® 

The planter’s demand for indentured servants was based on his 
calculation of the discounted value of their net future earnings, after 
deducting the expected costs of the servant to him. The present value 
of the servant to the planter therefore depended upon the expected 
value of the servant’s output in each year of the contract; the expected 
cost of maintenance, supervision, and training for the servant during 
each year of the term; the discount rate; and the value of the freedom 
dues to be paid to the servant. These, or analogous variables, are the 
same considerations which enter into the derivation of a free worker's 
net age-wealth profile (Becker 1975, p. 223). Yet one critical differ- 
ence is that, whereas for free workers evidence on flows is used to 
calculate the values of capital stocks, in the case of servants these 
calculations were performed by planters, who based their demand for 
stocks of bound labor for fixed terms on their calculations involving 


> See Bruce 1907, 1:622; Smith 1947, p. 39; and Middleton 1953, pp. 145-56. 

6 While £6 was the fare cited in the early colonial period, after the middle of the 
seventeenth century £5 was the fare normally quoted for passage to all colonies. For 
references to quotations of passage charges from England to a number of colonies, see, 
e.g., John Smith (1624, pt. 1, p. 162); Purchas (1625, pt. 4, p. 1791); Bullock (1649, p. 
47); Taunton (January 4, 1670); Wilson (1682, p. 19), Littleton (1689, p. 17); Jeaffreson 
(1878, pt. 2, p. 102); Kingsbury (1906, pt. 1, pp. 277-78), Abbot Emerson Smith (1947, 
p. 35); and Alexander (1972, p. 45). A qualification to the statement in the text is 
possible seasonal variation in the cost of delivering servants to some colonies, discussed 
below. 
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the relevant flows. Therefore, like the slave market, the market for 
indentured labor produced capital values, and the flows underlying 
these must be inferred. 

The institutional arrangements which provided for the ownership 
of human capital for discrete periods produced potential differences 
between the patterns of human-capital values under indentured ser- 
vitude and those implicit in the operation of free labor markets. An 
advantage of the indenture system for the planter, relative to the 
hiring of free workers, was the greater control it gave him over 
the servant's time and effort once the bargain was made. Because the 
servant would have been compensated for his loss of freedom, in the 
absence of uncertainty, the price of indentured human capital would 
have been higher than that of free workers.’ However, in practice, 
uncertainty makes the direction of the inequality between these prices 
unpredictable without additional information, for it cannot be deter- 
mined a priori whether the insurance value of the contracts was 
normally greater to the planter or the servant.® 

All servants who migrated to America incurred debts of similar 
value. As noted above, passage charges were uniform for all servants, 
and maintenance costs and freedom dues varied little across indi- 
viduals. As a result, every servant contract was a promise to repay 
approximately the same sum of money. Therefore, the present dis- 
counted value of every servant's contract should have been approxi- 
mately the same at the time of binding. Since the productivity of 
servants varied, the conditions of their indentures had to vary. The 
higher ther servant’s marginal-value product above his maintenance, 
the faster he could repay the loan made to him, and the shorter the 
term of the contract. The length of indenture across servants 
should, therefore, have been negatively correlated with individual 
productivity or, equivalently, with the market valuation of the current 
flow of income generated by the individual's stock of human capital. 


7 This assumes a solution to the problem of monitoring the servant’s work in which 
the productivity gains from the master’s control over the servant's time and effort were 
not offset by shirking. 

8 In a world in which futures contracts for free wage labor could be made with certain 
fulfillment, the present value of an indenture for a given number of years and of a 
series of contracts for hires for the same years would differ only by a premium which 
would reimburse the servant for the loss of freedom resulting from his residence in the 
master’s household and due to other legal provisions governing servitude. However, in 
the absence of these guaranteed futures contracts, in some cases the master might also 
have been willing to pay more for an indenture because of the assurance it gave him of 
labor supply in peak seasons or future years; similarly, workers might sometimes have 
been willing to accept lower implicit wages in return for the guarantee of employment 
the indenture represented. The relative cost of this insurance to master and servant 
could vary, and as a result the relation between the implicit wage paid to servants and 
the hire rate for free workers is indeterminate. 
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Strictly, if costs had been precisely the same for all servants, all 
should have been bound on conditions that would yield the same 
expected price on arrival in the colonies. This, of course, does not 
imply that no variation should have occurred in the realized auction 
prices of servants in the colonies, for random disturbances between 
the time of binding and auction—such as illnesses of servants during 
the voyage or changes in the price of colonial outputs—could have 
produced differences between actual and expected prices. What the 
analysis does imply, however, is that if the market for contracts of 
servitude was efficient, variation in servants’ auction prices should 
have been uncorrelated with all the characteristics of the servants 
which were known at the time of making the indenture bargain.* 

Very little evidence of the prices of newly arrived servants at colo- 
nial auctions has survived: From the entire colonial period the only 
known records of auctions of English servants come from the ac- 
counts of two ships, the Tristam and Jeane, which sold 68 servants in 
Virginia in late 1636 or 1637, and the Abraham, which sold 56 servants 
in Barbados in January 1637 (London, Public Record Office, 1636, 
1637). The listings of the amounts paid for the servants show that the 
median and modal prices of both men and women were identical— 
500 pounds of tobacco—in both auctions. Thus, although as will be 
seen both sex and colonial destination had a significant effect on the 
length of servants’ indentures, the available evidence, though limited, 
suggests that neither may have affected the initial colonial auction 
prices for the servants. This is consistent with the hypothesis that the 
variable dimensions of the contract were adjusted for the servants’ 
characteristics so as to make the expected auction prices of all servants 
the same. 

The relation between auction prices of servants and the simple cost 
of passage is also of interest. The modal price of 500 pounds of tobacco 
observed in the auctions was 11 percent greater than the fare that two 
free passengers each paid for passage to Virginia on the Trstam and 
Jeane on the same voyage as the servants and 4 percent above the fare 
quoted elsewhere for passage from England to Maryland in 1638.!° 


* This statement neglects one potential element of cost. The marginal cost of delivery, 
and therefore auction prices, would have included any costs the merchant incurred in 
recruiting servants in England. These could clearly vary across individual servants, 
producing differences in expected auction prices. These costs could have been corre- 
lated with individual productivity, as in some instances skilled servants were given 
lump-sum payments at the time of binding. For some evidence of this practice, see the 
analysis of the Middlesex sample in the Appendix. 

10 The two free passengers each paid 450 pounds of tobacco (London, Public Record 
Office 1637). For the 1637 Chesapeake farm price of tobacco, see Menard (1975, p. 
475). The fare quotation, of £6 sterling per person, is in Maryland Historical Society 
(1889, p. 206). 
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That in these cases the typical auction prices tended to be higher than 
the fare may have been due to the cost of recruiting servants, or to the 
existence of a premium received by merchants for bearing the risk of 
servant mortality on the ocean voyage. 

Related evidence on the nature of the labor-market equilibrium in 
the colonies which induced flows of labor from England is available 
from surviving valuations of indentured servants recorded in Mary- 
land for probate courts. The mean price of 28 male servants with 4 
remaining years of servitude recorded on Maryland’s lower western 
shore during 1704—57 was £8.95, with a median of £9 and a mode of 
£10; while the mean price of 19 females with 4 years remaining was 
£7.75, with a median of £8 and a mode of £10." The typical marginal 
cost to an English merchant of delivering servants to the colonies 
cannot be estimated precisely but probably fell within a range 
bounded at the lower end by £5, the usual cost of passage, and at the 
upper end by the £10 estimated by Abbot Emerson Smith as the 
maximum expense of delivery.’ The evidence of the probate 
valuations—although again limited in quantity—suggests that the 
central tendency of the colonial price of servants did fall within this 
range and, therefore, offers additional support for the hypothesis 
that the price of a servant’s indenture in the colonies was equal to the 
marginal cost of delivering labor there from England. 

A number of other general considerations relating to the adjust- 
ment of the bargain deserve mention. One is the role of mortality. 
The smaller the probability of a servant’s serving a given year of his 
term, the lower his expected net earnings and, ceteris paribus, the less 
favorable the terms of the contract he would be able to sign. This is 
true whether the mortality in question is that during the Atlantic 
crossing, when the merchant bore the risk, or that after arrival in the 
colonies, when the planter had purchased the contract and assumed 
the risk.'% 

A second factor with a similar effect was the possibility of a servant 
failing to serve out his term for a reason other than death, principally 


1 | am grateful to the St. Mary’s City Commission, Annapolis, Maryland, for tran- 
scriptions of the probate price quotations, from Maryland Hall of Records. Most of 
the quotations are from the early decades of the period; on devaluation of Maryland 
currency and its reflection in probate valuations, see Main (1972, pp. 14-18) and 
McCusker (1978, pp. 189-204). } ; 

2 Smith 1947, p. 37. A precise analysis would include a positive premium in the 
probate valuations, for these servants had normally been “seasoned,” i.e., had spent a 
year in the new colonial disease environment, and consequently had a longer life 
expectancy than the new arrivals, ceteris paribus. Ae 

18 Mortality rates differed among colonies, and the effects of this on the length of 
indenture will be discussed below. The assumption here will be that mortality rates 
among servants both during passage and in the colonies were not systematically related 
to individual productivity. 
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running away. All the colonies enacted legislation intended to dis- 
courage servants from running away; of these some were preventive 
measures and others punitive. While some colonies provided for 
corporal—and even capital—punishment for runaways, the most 
common penalty was extension of the servant’s contract by some 
multiple of the time he was absent.'* One of the provisions of the 
contract, the servant’s freedom dues, constituted a nonvested pension 
and, therefore, also acted to discourage servants from running away. 
While the form and worth of the dues varied across colonies and over 
time, they were often of substantial value and could have constituted a 
significant deterrent to servants who considered escaping from their 
masters.!° 

Servants were not allowed to marry during their terms. Since, by 
English practice, the expense of raising the illegitimate children of 
servants fell on the county, colonial legislation provided that the 
father should be discovered by oath of the mother and that he should 
reimburse the county for the expense of raising the child until it could 
be bound out to work. Since servants could not normally pay this sum, 
either the master would pay it and the servant’s term would be 
extended, or the servant would be bound over to the county for an 
additional term after the conclusion of his normal term, to be sold for 
the necessary amount. The mother’s term was also extended to reim- 
burse the master for her lost working time. 


III. Empirical Results 


The principal sources of quantitative evidence bearing on this market 
for human capital are servant contracts recorded and held in English 


‘* For references to laws relating to runaways, see Semmes (1938, pp. 116-18) and 
Smith (1947, pp. 264-70). The strong terms of this colonial legislation were an impor- 
tant element in the indenture system’s success, and some authors have argued that 
changes in the legal provisions for the enforcement of contracts were central to its 
decline in the early nineteenth century; see, e.g., Geiser 1901, p. 42. Similarly, the lack 
of success of the attempt to revive a contract labor system during 1864-85 has been 
attributed in part to the high cost of enforcing contracts through civil action; see 
Erickson 1957, pp. 46-48. 

® On freedom dues, see Smith (1947, pp. 238-41) and Heavner (1976, pp. 50-51). 
For a discussion of an analogous provision, nonvested pensions as a firm’s insurance 
against quits, see Becker (1975, p. 34). The nature of freedom dues has some implica- 
tions for other dimensions of the servant contract. The dues were specified by colonial 
law and were equal for all servants in a given colony. Therefore, the discounted cost of 
the dues to the planter at the time he purchased a contract varied inversely with the 
length of the contract. One effect of the fixed nominal value of the freedom dues may, 
therefore, have been to reduce the amount of variation in the length and other 
dimensions of the contracts, since the variations in the latter were intended to equalize 
the net present values of all contracts. This point should not obscure another basic 
effect of the existence of freedom dues, for, ceteris paribus, they raised the cost of 
servants and therefore tended to lengthen the term of indenture. 


EVALUATION OF HUMAN CAPITAL 453 


courts. These were made both to protect servants from kidnapping 
and to protect merchants from false charges of kidnapping. The two 
largest known surviving collections will be analyzed here; the earlier 
set, recorded in Middlesex during 1683-84, covers a total of 812 
individuals, while the later, recorded in London during 1718-59, 
covers 3,187 servants. 

The principal variable analyzed here will be the length of the 
indenture. Furthermore, this analysis will be done only for the minors 
in both samples; these comprise 22 percent of the servants with 
known ages in the earlier and 67 percent in the later sample. The 
selection of the variable to be analyzed and of the minors follows from 
consideration of the evidence of the contracts. It is clear that 4 years 
was the normal duration of an adult’s indenture in both samples, yet 
for a combination of legal and clerical reasons the full set of condi- 
tions of adults’ servitude do not appear to have been recorded in 
either set of indentures. For minors, the full conditions do appear to 
have been recorded, and the duration of the contract appears to have 
been the chief variable dimension of the contracts. Both variations in 
freedom dues and restrictions on the servants’ occupations in the 
colonies were rare. Cash payments made to adults were not generally 
recorded in the later sample, but the contracts of the minors, on 
which they do appear to be recorded conscientiously, show that 
fewer than 6 percent received cash payments. Some cash payments 
are recorded for both minors and adults in the earlier sample, and 
although it is uncertain whether payments were recorded in all cases 
in which they were made, less than 7 percent of the contracts contain 
such entries. A notable feature of the payments recorded in both 
samples is that virtually all—95 percent in the earlier and 97 percent 
in the later sample—were made to servants bound for 4 years. Insofar 
as payments were made (or promised) and recorded, analysis of the 
characteristics of servants who received them tends to reinforce 
the results obtained from the analysis of the length of indenture.’® 
The evidence of both samples, therefore, indicates that for minors the 
greatest variation in the conditions of indenture occurred in the 
duration of the term of servitude, while for adults this was not the 
case, as 4 years was both the standard term for adults and normally 
the minimum term assigned.” For adults, cash payments were appar- 


6 See the Appendix. 

17 Terms of less than 4 years do occur, but they appear to have been rare after the 
midseventeenth century; thus, they account for only 0.5 percent of all indentures of 
known length in the Middlesex registrations of 1683-84 and 1.2 percent of those in the 
London registrations of 1718-59. The reasons for this are not known. Four years may 
have been the term required at most times and places for the average adult to repay the 
cost of passage out of his net earnings, but it is unclear why shorter terms were not 
more often given to highly skilled servants like the accountant James Corss, whom 
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ently substituted for reductions in the length of servitude below 4 
years. 

Earlier analysis suggested that a servant's term of indenture would 
be inversely related to the market valuation of his stock of human 
capital. This index of the servant’s human capital can be related to a 
number of observable characteristics potentially relevant to the de- 
termination of the present value of that stock. When this is done by 
multiple-regression analysis, the estimating equation differs from the 
common hedonic method only in the use of an index for price. The 
estimated coefficients of the independent variables age, sex, literacy, 
and occupation can be interpreted as the marginal prices paid for 
servants’ characteristics in units of the index, while those of desti- 
nations represent compensating differentials among regions.'* 

Table 1 shows a number of the basic relationships underlying the 
market valuation of the servants. In both samples, the length of 
indenture was negatively related to both age and skill: With other 
things equal, servants with skilled occupations and those able to sign 
received shorter terms. Women received shorter terms than men, 
ceteris paribus, and servants bound for the West Indies received 
shorter terms than those bound for the North American mainland. 

A comparison of the estimated coefficients of the sex variable across 
samples indicates that, on average, women received considerably 
larger reductions in their terms in the 1680s than in the eighteenth 
century. This decline in the premium for females is not surprising in 
view of the generally declining colonial sex ratios during this period, 
for while women were preferred for some kinds of household work 
and some types of farming, their increasing relative availability in 
most colonies would be expected to lower the size of their wage 
differentials.!9 

Table 2 provides a more detailed analysis of the length of indenture 
for the later sample, allowing separate age profiles of length of inden- 
ture by sex and skill. It reveals that there was a tendency for women 
to receive indentures from 5 to 15 percent shorter than those of men 
through the age of 17, while for servants aged 18-20 there was no 


Walter Tullideph sent to the manager of his plantation in Antigua in 1759 with a note 
stating that he “hath bound himself to serve me four years agreeable to the Laws of 
Antigua, but as he is 22 years of Age, he thought it hard to serve so long and for that 
reason, I have given him a Certificate that he is to be absolved from the last year’s 
Service” (Tullideph 1759, vol. 3). That planters preferred to substitute salaries for 
reductions of the term below 4 years suggests the possibility that fixed costs of hiring 
and/or a desire to capture the returns from a servant’s general training in the colony 
may have been important considerations. 

'S On the interpretation of coefficients in hedonic price indexes, see Rosen (1974, pp. 
34-35). 

'® ‘The difference between samples in the sex coefficients in table 1 is significant at the 
-O1 level. On declining colonial sex ratios, see Wells (1975, pp. 156, 219, 244). 


TABLE 1 


ESTIMATED REGRESSION COEFFICIENTS, MIDDLESEX AND LONDON SAMPLES 














MIDDLESEX, 1683-84 Lonpbon, 1718-59 
Estimated Standard Estimated Standard 
INDEPENDENT VARIABLE Coefficient Error Coefficient Error 
Age (years):@ 
Total sample: 

Less than 15 2.655 .385 2.749 134 

15 2.201 .400 2.147 .080 

16 1.457 304 1.304 .068 

17. .893 367 .728 .062 

18 .174 .270 331 .055 

19 .738 306 .169 .050 
Sex? — 1.484 .207 rl .073 
Literacy° OD 2A SOG2 037 
Date? ae — —.0060 .0023 
Trade® Ou, 445 as — 
Farmer‘ ae hs —.313 .074 
Laborer a ae —.146 .079 
Services’ a Pion —.348 .066 
Metal and construction® ae BS —.320 .067 
Clothing and textiles! sre 6 Days Solis 060 
Antigua) rT) .812 —.403 SLO) 
Barbados —.553 .274 —.176 .154 
Jamaica SeouS 462 230 .060 
Other West Indies* —.401 1.094 —.479 .088 
Maryland .203 .209 306 .059 
Virginia ae nee le .073 
Other mainland! —.389 .673 .050 Ale) 
Constant 5.227 Mek 4.665 re 
R2 .555 fairs 539 
F 12.87 ee 112.82 
n 171 ee 2,049 





Source.—Data used are all from records of minors (age less than 21). Middlesex, 1683-84: London, Greater 
London Record Office; Nicholson (1965); Wareing (1976). London, 1718-59: London, Corporation of London 
Records Office; Kaminkow and Kaminkow (1964); Galenson (1977a). 

Note.—Dependent variable = number of years indentured; method of estimation used is ordinary least squares 
in tables 1 and 2. 

For age variable, indicated age = 1; zero class = age 20. 

> Male = 0, female = 1. 

© Marked = 0, signed = 1. 

4 Date entered as final two digits of year of registration. 

© Trade = 0 for laborers and no recorded occupations; trade = 1 for all other men’s occupations. 

‘For all occupational variables, indicated occupation(s) = 1; zero class = no recorded occupation. “Farmer” 
includes husbandman, plowman, etc. 

® Includes accountant, barber, surgeon, etc. 

» Includes blacksmith, carpenter, cooper, mason, etc. 

! Includes clothier, tailor, weaver, etc. 

) For all destination variables, for Middlesex sample, zero class = Virginia; for London sample, zero class = 
Pennsylvania. 

* Includes Nevis, St. Christopher, etc. 

' Includes Carolina, New York, etc. 
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TABLE 2 


EsTiIMATED REGRESSION COEFFICIENTS, LONDON SAMPLE, 1718-59 








Independent Estimated Standard 
Variable Coefficient Error 





Age (years): 
Total sample: 


Less than 15 2.976 plas 
15 21378 .092 
16 1.542 084 
es .959 .084 
18 473 .079 
19 .260 .083 
Women, age: 
Less than 15 — 1.034 460 
15 —.472 390 
16 —.969 .228 
ley —.302 161 
18 .041 .159 
19 .090 135 
20 .198 .156 
Trade, age:” 
15 — 1.496 : 392 
16 — .884 .206 
17 —.502 slats 
18 —.275 .085 
19 —.224 .073 
20 —103 .076 
Literacy —.076 .036 
Date —.0093 .0024 
Antigua® 200 235 
Barbados —.005 .268 
Jamaica —.084 .209 
Other West Indies =.363 ell 
Maryland .194 .063 
Other mainland —.039 .072 
February? —.100 .092 
March eal aS 
April .156 Salo 
May —.095 .164 
June 43 | Bills 
July SaOG plats, 
August 8816) .098 
September See .096 
October —.400 LO) 
November 225) .100 
December .014 .097 
Sugar® —.0162 .0074 
Constant 4.830 
R? .566 
F 52.06 
n 2,044 





Source.—London, Corporation of London Records Office; Kaminkow and Kaminkow (1964); Galenson (197 7a). 

Nores.—See table 1 for variables not defined here. 

@ Female age interactions: indicated variable = 1 for women of given age. 

» Trade age interactions: indicated variable = 1 for men of given age who recorded a trade (as defined in table 1). 

* For destinations, zero class = Virginia. 

4 For months, zero class = January. Separate interaction terms between West Indian destination and month of 
registration were included in the equation, but their coefficients were generally small in value and are not reported. 

* Sugar = average annual price of muscovado sugar in London, in shillings per hundredweight, lagged 1 year, for 
servants bound for West Indies (Sheridan [1974, pp. 496-97], with linear interpolation for 1717-20 and 1727). 
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difference in the length of term by sex. Ina suggestive parallel result, 
Robert Fogel and Stanley Engerman found that, excluding the value 
of childbearing, the net earnings of female slaves were greater than 
those of men prior to the age of 18, apparently due to the more rapid 
physical maturation of women.”° The sex differentials in the terms of 
young indentured servants might have resulted from the same 
source. 

The results of table 1 indicate that premia for skills were reflected 
in the length of servants’ indentures. The more detailed specification 
of the occupations presented in table 1 for the London sample indi- 
cates that the marginal premium paid for servants in each of four 
occupational categories—farmers, services, metal and construction 
crafts, and clothing and textile trades—was virtually the same.?! Un- 
skilled laborers received terms longer than servants with skilled occu- 
pations but shorter than those with no recorded occupations;” the 
latter result may indicate that some premium was paid for the labor- 
ers’ work experience. 

Economists have devoted considerable attention to the analysis of 
the relationship between productivity and age and have accumulated 
much evidence on the association between age and wages in recent 
periods. Less is known of the nature of this relation in past times. It is, 
therefore, of some interest to consider in more detail the implications 
of the estimated relationships between age and length of indenture 
for the age-earnings profiles of servants.”* Table 3 presents estimates 
of the relative annual net earnings of servants by age for unskilled 
and skilled men. The calculations are based on the assumption that 
the expected colonial sale price of each individual’s contract was equal 
to the constant marginal cost of delivering servants to the colonies.” 
The relationship between age and net earnings is assumed to have 
been linear, making average net productivity during the term equal to 
net productivity at the term’s midpoint. The estimates of average net 


20 Fogel and Engerman 1974, p. 77; see also Metzer 1975, pp. 136-37. Interestingly, 
evidence on the hourly earnings of North Carolina cotton mill employees in 1907 
indicates that average female earnings were above those of males through the age of 15, 
equal at 16, and below male earnings thereafter (Wright 1980, p. 6). 

21 None of the four coefficients is significantly different from any of the other three at 
the .10 level. On the value of skilled servants in the colonies, see, e.g., Martin (1761, vol. 
4, fol. 97, verso); Jeaffreson (1878, 1:186); and Galenson (1979a, pp. 314-19). 

22 The coefficient of laborer for the London sample in table 1 is significantly different 
from that of farmer at the .10 level for a one-tailed ¢-test, from those of metal- 
construction and clothing-textiles at .05, and from that of services at .025. 

23 It might be noted that the erratic behavior 6f the coefficients of higher ages in table 
1 for the Middlesex sample may have been due to the falsification of the ages of some 
servants. This may have resulted from the legal requirements under which the regis- 
trations were made; for discussion and evidence see Galenson (1979a, appendix to 
chap. 3). 

2 posible seasonal variation in delivery costs has been controlled for in the equation 
reported in table 2. 
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productivity are derived from the following formulation of the mean 
present value of the contracts of servants of age at the beginning of 
the term: 


n 


> NPY = wi, 
IONS (eee 


i=1 


where NP = expected mean annual net productivity during the term; 
n = mean length of term for servants in each entering age group; 
w = mean annual wage payments made to servants during the 
term; r = discount rate. 

To solve for the value of NP for each entering cohort, the mean 
present value of the contracts was set equal to £10, an estimate of the 
marginal cost of delivering servants to the colonies. The mean age of 
each group at the time of binding was taken as the recorded age plus 
one-half year to allow for the rounding of age upon registration. The 
mean length of term by age was derived from the coefficients of table 
2, while the mean annual wage payments were taken directly from the 
indenture contracts.”> The estimates were made with a discount rate 
of 10 percent. 

The estimates of table 3, which indicate that the net-earnings 
profile of skilled servants was steeper than that of the unskilled, are 
consistent with the normal positive relationship between the steepness 
of age-earnings profiles and training. The ages at which the servants 
considered here were bound, between 15 and 20, were prime ones for 
training in a wide variety of skilled crafts in preindustrial England, 
through either apprenticeship or less formal arrangements. It is, 
therefore, not surprising that the net productivity of those in skilled 
trades rose rapidly during this period of the life cycle. 

Skilled servants received a considerable premium: Even at age 15, a 


*> No wages were recorded for unskilled servants. The average wage payments 
received by the skilled servants in the London sample by age were as in the table below. 
The present value of freedom dues at the time of binding varied across colonies and 
over time, according to differences and changes in legislation and with changes in the 
value of colonial currencies and commodities. No explicit allowance has been made for 
the dues in the calculation because of the difficulty of estimating their typical value; 
inclusion of the effect of the lump-sum payment would lower the estimates of the net 
annual earnings of servants without changing their relative values by age. 














Age (Years) Mean Annual Wage (£) 
15 0 

16 40 

17 79 

18 1.45 

19 1.52 


20 3.02 
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TABLE 3 


ESTIMATED RELATIVE NET ANNUAL EARNINGS OF SERVANTS BY AGE 














UNSKILLED SKILLED 

Mean Net Mean Net 

Annual Relative Annual Relative 

Earnings Net Earnings Earnings Net Earnings 
Age (ie (Age 22.9 = 1.00) Age (£) (Age 22.9 = 1.00) 
19.1 2.01 .742 18.4 2.39 413 
19.7 220 .812 19.2 2.86 495 
20.4 2.36 871 20.1 3.32 574 
DAee 2.52 .930 21.0 4.08 .706 
22.0 2.60 .959 21.9 4.22 .730 
22.9 Sati 1.000 22.9 5.78 1.000 





Nore.—Calculated from table 2 and n. 24. See text for procedure. The unskilled profile is calculated from the 
basic age profile of table 2, that of the skilled from the basic age profile combined with the skilled (“trade”) 
coefficients. 


skilled servant received a term 21 percent shorter than his unskilled 
counterpart. The existence of a sizable premium at such an early age 
could have been due in part to differences in the relative average 
work experience of the skilled and unskilled. Thus, possession of a 
skilled trade at any age implied prior work experience. The age of 
entry into the labor force for those men registered without occupa- 
tions cannot be determined, but it is possible that the typical age of 
entry for the unskilled into employment was that at which English 
boys normally left home to live in service, roughly 15. If this were the 
case, work experience and acquired on-the-job training might have 
accounted for a significant portion of the premium for skilled work- 
ers. This would particularly be true for the younger servants, as the 
relative level of work experience of a skilled to an unskilled worker 
would be greatest at the lower ages observed here and would decline 
with age thereafter. That the ratio of skilled to unskilled net earnings 
increased with age would appear to be strong evidence of the pres- 
ence of formal training for those in the skilled group. 

An interesting feature of the relative age—net-earnings profile of 
the unskilled shown in table 3 is its close resemblance to those profiles 
obtained by Fogel and Engerman for unskilled male slaves in the 
southern United States during 1790-1860 (1972, charts 3 and 4). In 
view of the considerable differences among these samples with re- 
spect to such variables as location and crops cultivated, the similarity 
of the shapes of the profiles might suggest the importance of physio- 
logical factors, particularly the rate of physical maturation, in deter- 
mining the change, with age, in the productivity of unskilled workers 
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under conditions of plantation agriculture in the eighteenth and 
nineteenth centuries. 

The relatively small premium paid for the ability to sign in the later 
sample may have been due both to the abundance of literate servants 
and to high literacy rates in the colonies. The decline in the size of the 
reduction of the term due to the ability to sign between the dates of 
the two samples may have resulted in part from a considerable in- 
crease in literacy among the servants, as only 35 percent signed in the 
earlier sample compared with 67 percent in the later one.”® 

In both samples, with other characteristics constant, servants bound 
for the West Indies tended to receive shorter terms than those bound 
for mainland colonies. That servants who immigrated to the West 
Indies received shorter terms to compensate them for their choice is 
consistent with the fact that, while both working conditions for ser- 
vants and economic opportunities for freedmen were known to be 
poor in the islands after the introduction of large-scale sugar cultiva- 
tion in the second half of the seventeenth century—with its attendant 
slave gangs and consolidation of small farms into large estates—the 
mainland long continued to be considered a land of opportunity for 
poor immigrants, where freed servants could hope to own land and 
become prosperous members of society. A persistent theme of West 
Indian complaints appeared in a 1675 petition sent to the king of 
England by the Council and Assembly of Barbados: “In former tymes 
Wee were plentifully furnished with Christian [i.e., white] servants 
from England . . . but now Wee can gett few English, having noe Lands 
to give them at the end of their tyme, which formerly was their main 
allurement.” The higher mortality rates of the West Indian colonies 
decreased servants’ expected productivity and made them reluctant to 
go to the region. However, in conjunction with the high productivity 
of labor in sugar production, those rates acted to raise the demand for 
new flows of replacement immigrant labor. That servants bound for 
the West Indies received terms shorter than those bound for the 
mainland, in spite of the higher mortality rates in the islands, implies 
that the marginal productivity of labor was higher in the West Indies 
than in the mainland colonies.’ 

Both the lower estimated intercept for the later sample and the 
estimated negative time trend of the later sample indicate a secular 
decline in the length of indenture. The direction of change is consis- 


26 On the relationship between ability to sign and other aspects of literacy in this 
period, see Schofield (1968, pp. 311-25). The difference between the coefficients of 
literacy in the two samples is significant at .10 for a two-tailed t-test. On the servants’ 
ability to sign, see Galenson (19796). 

27 London, Public Record Office 1675. On relative mortality rates in the West Indian 
and mainland colonies, see Wells (1975, pp. 280-82). 
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tent with a number of long-term trends, including rising reservation 
wages of servants due to rising real wages in England between the 
mid-1680s and the middle of the eighteenth century, falling real 
shipping costs, and declining colonial mortality rates which could 
have produced a secular increase in the colonial demand for labor.28 
The negative estimated trend further suggests the presence of a 
secular increase in real wages in the colonies during the period 
spanned by these two samples. 

A consistent seasonal pattern in the length of indenture appears in 
the results of table 2, as servants bound for mainland destinations 
whose indentures were signed between June and November received 
sizable reductions in their terms relative to servants bound in winter 
and spring. Most of these servants were bound for Maryland or 
Virginia, and the observed pattern could have been due to the effect 
of the seasonality of tobacco production on the costs involved in 
supplying servant labor to the Chesapeake. The shipping patterns 
which resulted from the timing of the harvests dictated that the 
amount of backhaul space for servants was greatest in summer and 
early fall. Since servants were provided with food and lodging from 
the time they signed their contracts, the cost of delivering a servant to 
the colonies may have declined in peak shipping seasons because the 
more frequent departures of ships reduced the average waiting time 
in port between binding and sailing. The lower costs of the peak 
seasons could, therefore, have resulted in shorter indentures for 
servants bound in peak seasons than for those bound in slack shipping 
months.?? 

Another potential source of variation in the length of indenture was 
annual changes in the colonial demand for labor. The results pre- 
sented in table 2 indicate that the lagged annual average price of 
muscovado sugar in London had a significant and negative effect on 
the length of indentures of servants bound for the West Indies during 
1718-59; the estimated effect of a change in the price of sugar from 
its minimum to its maximum in the period, with other things equal, 


28 On English wages, see Gilboy (1934, pp. 219-25) and Phelps Brown and Hopkins 
(1956, pp. 302-13). On changes in colonial mortality rates, see, e.g., Menard (1977a, 
pp. 99-100). The organization and quality of information in the market for contracts 
may have improved during the period spanned by the two samples. It is suggestive that 
the coefficient of variation of the term of indenture among men of a given age was 
considerably lower in the later than the earlier sample for six of the eight age groups of 
minors above the age of 12. On the relation of wage dispersion to information, see 
Stigler (1962). Pian 5 i 

2° On the seasonality of tobacco production and shipping, see Bullock (1649, p. 46); 
Alsop (1666, p. 51); and Bruce (1907, vol. 1, pp. 622-24). On the costs of maintaining 
servants between binding and sailing, see Smith (1947, pp. 36-37, 59-65) and Scottish 
Record Office (GD 23/6/98, nos. 4, 14, 18). For factors relevant to seasonality in the 
colonial demand for labor, see Mullin (1972, p. 15) and Morgan (1975, p. 158). 
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was a reduction of about 5 months in the term of indenture. The sign 
of this effect would be the one predicted if, as appears to have been 
the case, high sugar prices normally resulted from high levels of 
demand for sugar rather than reductions in supply, for high sugar 
prices would then have tended to produce high demand for labor 
and, ceteris paribus, to shorten terms.” 


IV. Conclusions 


The price paid for human capital in the colonial American market 
for indentured servants varied systematically with respect to factors 
which influenced servants’ productivity, as economic theory predicts. 
All servants incurred debts of similar value in immigrating to the 
colonies and sold claims on their future labor, in the form of inden- 
tures, to repay these debts. Characteristics which raised the expected 
productivity of servants in the American colonies raised the market 
valuation of their human capital and, therefore, shortened the term 
for which the servant was bound. Thus, servants with skilled trades 
and those able to sign served shorter terms than the unskilled and 
illiterate of similar age and sex. Women were found to have received 
shorter terms than men until the age of 18, perhaps due to their 
earlier maturation. The results also indicated that servants bound for 
the West Indies received shorter terms in compensation for their 
undesirable destinations. Servants bound during peak shipping sea- 
sons were found to have received reduced terms, perhaps due to the 
shorter average waiting time prior to departure in those months when 
backhaul cargo space was most abundant. Finally, the length of inden- 
ture of servants bound for the West Indies was found to have varied 
inversely with the price of sugar, suggesting that increases in the 
colonial demand for labor shortened the term of servitude. 
Among the issues which need further investigation are the precise 
reasons for the observed shapes of the age-earnings profiles of ser- 
vants. Additional research, including the collection of evidence on 
wage rates in the colonies by age and skill, may serve to distinguish 


%° Similar analysis of the contracts of servants bound for the Chesapeake during 
1718-40 indicates that the lagged annual farm price of Maryland tobacco had no 
significant effect on the length of indenture; in a regression equation which included 
the variables of table 2, the estimated coefficient of an interaction term between the 
price of tobacco, lagged 1 year, and Chesapeake destination was insignificant. Due to a 
greater continuing reliance on production of a single staple in the West Indies in the 
eighteenth century, the price of sugar may serve as a better index of the West Indian 
demand for labor than does the price of tobacco for the Chesapeake. On diversification 
of agricultural production in the Chesapeake, see Clemens (1974, pp. 100-148) and 
Stiverson (1977, pp. 65-103). 
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and isolate the effects of such contributory factors as physical matu- 
ration and investment in human capital in producing the age- 
earnings relationships which lay behind the market valuation of in- 
dentured human capital. What the present research has indicated is 
that the application of economic analysis to quantitative evidence 
generated by the system of indentured servitude can provide infor- 
mation on the way in which the market once explicitly evaluated 
stocks of human capital and, in so doing, can yield new insights into 
the operation of labor markets in early America. 


Appendix 


For the Middlesex sample of 1683-84, a regression equation was estimated 
with the amount of the cash payment (in shillings, sterling) made to a servant 
as the dependent variable, with the same independent variables used in the 
analysis of the length of indenture. The sample used was that of all servants, 
minors and adults, whose contracts contained all the necessary information. 
The results are as shown in table Al. 

The hypothesis that all the coefficients are simultaneously equal to zero can 
be rejected at the .01 level, but the proportion of the variance explained is 
small, as might be expected in view of the rarity of the payments. Only one 
coefficient is significant at the .01 level: Possession of a skilled trade, which 
table 1 shows to have reduced the term of an indenture, significantly raised 
the expected cash payment made to a servant. 

As noted in the text, cash payments appear to have been recorded only for 
minors in the London sample of 1718-59. The form of the payments re- 
corded differs from that of the earlier sample: Whereas the cash payments to 
servants in the Middlesex contracts of 1683-84 appear to have been simple 
lump-sum payments made at the time of binding, those contracted for in 
London in the eighteenth-century sample were generally salaries to be paid 


ABU Real 


ESTIMATED REGRESSION COEFFICIENTS, MIDDLESEX SAMPLE, 1683-84 











Independent Estimated Standard 
Variable Coefficient Error 
Age (years) 011 .038 
Sex? —.485 456 
Literacy” rato) 363 
Trade® 1.466 399 
West Indies? .088 354 
Constant R323 ate, 
inves .026 
F AON 
n 760 





Source.—London. Greater London Record Office (Middlesex Section). 
4Male = 0, female = 1. 

>Marked = 0, signed = 1. 

*Laborer cr no occupation = 0; all other occupations = 1. 

‘West Indian destinations = 1; mainland = 0. 
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TABLE A2 


ESTIMATED REGRESSION COEFFICIENTS, LONDON SAMPLE, 1718-59 











Independent Estimated Standard 
Variable Coefficient Error 
Age (years .144 054 
oe : .306 329 
Literacy 428 nl G7, 
Trade 1.397 .190 
West Indies 1.291 165 
Constant —3.141 ae 
R2 .094 
F 42.20 
n 2,046 





Source.—London, Corporation of London Records Office. 
Nore.—All variables defined as in table Al. 


annually in local currency. A regression was estimated with the amount of the 
annual salary (in pounds local currency) as the dependent variable, with the 
same independent variables used above. (The dependent variable is the 
unadjusted value of the colonial currency. Although most of the colonies’ 
currencies were devalued relative to sterling, the differences in currency 
values across the principal American colonies in most of this period were 
small. See McCusker 1978.) The sample used was that of all minors (age less 
than 21) whose contracts contained all the necessary information. The results 
were as shown in table A2. 

The relationship 1s again statistically significant, and the proportion of the 
variance explained is again low. Age, literacy, possession of a skilled trade, and 
West Indian destinations are all significant at .01 and positively related to the 
servant's salary; in table 2 it is shown that all are significantly and negatively 
associated with the length of indenture. 

The results of the analysis of servants’ cash payments and salaries in both 
samples are consistent with the hypothesis that these were positively related to 
the servants’ expected earnings in the colonies. These results therefore rein- 
force the analysis presented of the length of indenture. 
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The Welfare Cost of Capital Income Taxation in 
a Growing Economy 





Christophe Chamley 


Yale University 


The welfare cost of capital income taxation is analyzed in a general 
equilibrium framework, where the private sector is represented by a 
competitive household endowed with perfect foresight and an in- 
finite life. The value of the welfare cost depends essentially on the 
elasticity of substitution between capital and labor in the production 
function. Numerical estimates are presented for different values of 
the parameters of the model. The welfare gain obtained by the 
abolition of the capital income tax is smaller when the private sector 
is not endowed with perfect foresight (it is reduced by about 40 
percent when expectations are myopic). The allocation efficiency 
cost of the corporate tax dwarfs the intertemporal welfare cost. 


A central issue in the current debate on tax reform is the efficiency 
cost of a tax on capital income. Such a tax introduces a wedge between 
the prices of consumption at different dates and distorts the inter- 
temporal allocation of resources. 

Previous studies on the efficiency cost of the capital income tax have 
relied on simplifying assumptions which rule out important conse- 
quences of capital taxation. In general, these studies are of two types. 
The first (Krzyzaniak 1967; Sato 1967; Feldstein 1974a, 19746; 
Friedlander and Vandendorpe 1978) analyzes the capital income tax 
in the context of a neoclassical growth model with endogenous capital 
stock and factor prices but does not allow an optimal response of 
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household to changes of taxes. Typically the levels of consumption 
and labor supply do not depend on future prices and are not consis- 
tent with utility maximization. This method leads to results about the 
long-run incidence of taxation on aggregate variables; the transition 
to the steady state can be examined by numerical simulations. The 
second type of study (Levhari and Sheshinski 1972; Feldstein 1978) 
allows for maximizing behavior but assumes that the factor prices are 
exogenous and independent of the capital accumulation. The welfare 
cost of the capital income tax is measured by an application of the 
Harberger-Hicks-Hotelling formula (Green and Sheshinski 1979) to 
the intertemporal framework. 

The welfare cost of the capital income tax is analyzed here in a 
stylized intertemporal general equilibrium model (described in the 
next section). Issues about intra- or intergenerational equity are ig- 
nored. Therefore, it is assumed that the private sector can be repre- 
sented by a household with an infinite life.' 

This household determines the level of consumption by the 
maximization of its intertemporal utility function. Future factor 
prices (wage and interest rates) depend on the accumulation of capital 
through a neoclassical technology and are known with perfect 
foresight. The household behaves competitively: Future (endoge- 
nous) prices are taken as given. 

The welfare cost of the capital income tax is analyzed in the second 
section. Following common practice, we consider the welfare cost 
induced by a tax with lump-sum redistribution. Initially, the economy 
is assumed to be on the balanced growth path where capital income is 
taxed at a fixed rate (and tax revenues are refunded). At time zero, 
the tax is abolished (together with the refunds). Thereafter, the 
economy moves on a dynamic path toward a new steady state. ‘The 
welfare cost of the capital income tax is equal to the welfare gain 
obtained by the abolition of the tax, namely, by the difference be- 
tween the level of utility on the new dynamic path (after the tax 
reform) and the level of utility on the initial balanced growth path 
(with the tax in effect).2 As usual, the welfare cost of the tax is 


1 It would be sufficient to assume that individuals’ utilities depend on their consump- 
tion in their own finite lifetime and on the welfare of their immediate descendants (for 
a development of this argument, see Barro [1974]), If individuals do not leave a 
bequest, the problem of intergenerational equity arises, and the concept of excess 
burden used here does not apply: Even in the case where no revenue has to be raised, 
the no-tax solution is not the first best; the level of capital has no optimal property 
(Diamond 1965). In order to redistribute income among generations, a benevolent 
social planner would impose a tax or a subsidy on capital income (and possibly on other 
goods). For an analysis of taxation in a general equilibrium model where individuals 
maximize a lifetime utility function with no bequest, see Hall (1969), Diamond (1970), 
Summers (1979), and Chamley (1980b). 

2 We could also consider the imposition of a tax on the initial balanced growth path 
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measured by a wealth equivalent and is of a second order with respect 
to the tax rate. A second approximation of this excess burden is given, 
which depends on the parameters of the utility and production func- 
tions and on the growth rate. An extension of the Levhari and 
Sheshinski (1972) result is obtained as a special case. Since the 
general excess-burden formula is exact only for infinitesimal values of 
the tax rate, the error of this second-order approximation is analyzed 
in a numerical example. 

The assumption of perfect foresight is relaxed in Section III. Be- 
cause the dynamic path after the abolition of the tax is no longer 
optimal, the welfare gain induced by the tax reform is smaller in this 
case. The case of myopic expectations is an important example of the 
more general class of expectations which are considered. In the fol- 
lowing sections we revert to the assumption of perfect foresight. 

The case where the tax rate on capital income is not identical for all 
sectors of production (an example is found in the corporation tax) is 
considered in Section IV. The intertemporal welfare cost of the tax is 
compared with the inefficiency cost due to the misallocation of capital 
between the different sectors of production. 

In Section V, the assumption of a fixed labor supply is relaxed. 
Since the capital income tax lowers the long-run wage rate, its excess 
burden depends on the (compensated) elasticity of the labor supply. 

This analysis of the capital income tax relies on a stylized model.? 
However, numerous numerical examples show that some of the re- 
sults obtained are fairly robust. In the conclusion, these results are 
summarized and related to other studies using more disaggregated 
models. 


I. The Model 


There is one good in the economy. This good can be consumed or 
used as capital in the production process. Total output per efficiency 
unit of labor, net of capital depreciation, is given by the neoclassical 
production function, y = f(k), where & is the level of the capital stock 
per efficiency unit of labor. 

The private sector is represented by a household, growing at the 
rate n, which takes prices as given and maximizes under its budget 
constraint the utility function, 


U =|) e Plenty (ce) dt, (1) 


with no taxation. However, the analysis of the abolition (instead of the imposition) of 
the tax is technically more simple and suits better the problems of tax reform. It has 
been verified that for small tax rates, the two methods give the same results. 

* The same method could have been used to analyze the excess burden of the labor 
income tax or the income tax (Chamley 1980a). 
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where the following notation is used: p = pure rate of time prefer- 
ence, # = rate of growth of labor augmenting technological change, 
and c, = consumption per unit of efficient labor. 

The function wu will be assumed to be of the form u(c) = c!~7.4 The 
labor supply per capita is fixed and normalized to one at time zero. 
The representative household is endowed with perfect foresight and 
behaves competitively, taking the endogenous future prices (wage 
and interest rates) as given. 

Because of the first-order condition in the maximization of the 
utility function, the dynamic path of the economy satisfies the 
equation® 


C 
Ca oe ep) (2) 
where 7; is the net rate of return available to the household, and p* = 


Duat Ome te 
The capital accumulation is defined by 


ke =f (ht) — (a + whe — C2, (3) 


where k, represents the initial capital stock. 

The dynamic behavior of the economy is defined by equations (2) 
and (3) and by the initial values ky and co at time zero. The initial value 
of the capital stock kp is given. There is a unique value of co, such that 
its associated dynamic path satisfies the budget constraint of the 
household (which is equivalent here to the transversality conditions). 
For this value of co, the dynamic path converges to the steady state 
defined by® 


De aa), (4) 
c* = f(hk*) — (n + w)R™ (5) 


(an asterisk will denote a steady-state value). 


4 This assumption is necessary for the existence of a competitive balanced growth 
path when the intertemporal welfare function is additive and the rate of labor aug- 
menting technological change yw is different from zero. (For an introduction to the 
literature on optimal growth, see Koopmans [1967].) In a discrete-time formulation one 
could also use a stationary utility function. The existence and stability of optimal 
balanced growth paths in this context is studied by Iwai (1972). 

5 The consumption levels at time ¢ and ¢ + At satisfy the relation, 


i (eet Ade yl, L pat 


u' (ec) 1 + rAt- 
Up to the first order, this expression is equivalent to 
u'(eMc,) + u"(e*c,)(cysAt + Acy)eM* 
u'(e“c;) 


A straightforward manipulation gives (1/e,)(Ac,/At) = (1/a0)(r — p — op). When At 


tends to 0, we obtain eq. (2). 
6 The second-order conditions are derived from the concavity of the utility and the 


production functions. For an exhaustive treatment, see Arrow and Kurz (1970). 





= 1+ (p —7)At. 
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The optimal path also defines a consumption function, giving the 
level of consumption per unit of labor as a function of the capital- 
labor ratio at each instant (see fig. 1), 


i= c(h) (6) 


In the same way, at a given instant, the level of utility is determined by 
the integral (1) on the optimal path I for an initial value of the capital 
stock which is equal to k: U = J(R). 

We now review a few properties of the optimal path which will be 
useful in the subsequent sections. 

The slope of the consumption function c'(k*) at the stationary point 
k* is obtained by taking the limit of the ratio between the relations (2) 
and (3) when & tends to k*; c'(k*) is equal to the positive root of the 
equation, 


x hey =O; (7) 


where \ = p* — n — wand y = —[c(k*)f"(k*)/o = (1/oe)(c(k*)(r* + 
5d)w*/{k*[f(k*) + 5k*]}); € is the elasticity of substitution between 
capital and labor in the gross production function when the capital 
labor ratio is equal to k*.’ Using a first-order approximation of the 
capital accumulation (3) around k*, the difference between & and its 
steady-state value k* decreases asymptotically at a constant rate a (toa 
first-order approximation):® 


(a Ra), (8) 


where a is the coefficient of adjustment of the economy toward the 
steady state and is equal to the difference c’(k*) — 4; —a is also equal to 
the negative root of equation (7). 

The same asymptotic rule applies to every endogenous variable z, in 
the economy which depends only on the capital-labor ratio (as, e.g., 
the gross factor prices):° 


(2 = ZS) tapes ie) (8a) 


where the coefficient a is the same as in equation (8). 


"The parameter y can also be expressed as a function of quantities which are easily 
measurable: 


eet) | = 


ae 
y= ae ~@+n+n)] 


Qa 


where a is the share of gross capital income in the production function and & is the 
depreciation rate of the capital stock. 

* This regressive rule may be a very good approximation even if the difference k — k* 
is large (see Chamley 1979). 

° If z = g(ke), to the first order: (z, — z*) = g'(k*)ky = —g'(k*)a(ky — k*) = —a(u —z*). 
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II. The Excess Burden of the Capital Income Tax 


Consider now a tax on capital income at the constant rate 6, with 
lump-sum redistribution of its revenues. The analysis of the model 
described in the previous section applies with a net interest rate r now 
given by r = (1 — @)f"(&). In particular, in the long run, the net rate of 
return is constant and still equal to p*. In the long run, the capital 
income tax increases the gross rate of return and lowers the capital 
stock and aggregate consumption. The levels of consumption and 
capital per unit of labor in the steady state with taxation, ¢ and k, 
respectively, are given by 


p* = (1 — 6)f'(k) (4a) 


and 
c=f()-mt wh. (5a) 


This steady state is represented by point 4 in figure 1. 

We assume that initially the economy is in the long-run equilibrium 
with taxation (fig. 1, point A). The deadweight loss of the tax is 
measured by the welfare gain induced by the suppression of the tax.'® 

At time zero, the capital income tax is abolished" (together with the 
lump-sum refund). The net rate of return suddenly increases, and 
consumption decreases immediately from ¢ to cy (the optimal value 
which depends only on the initial capital stock ky = k, as described in 
Sec. I). After time zero, because of the increased savings,” the capital 
stock increases. The economy moves toward the new steady state E on 
the segment BE of the path [ defined by the dynamic equations (2) 
and (3) (see fig. 1). 

The welfare gain of the tax reform AU is given by the difference 
between the utility on the path BE and the utility in the steady state 
A 213 


ey ee ©) 
AU = J(k) ay, (9) 


10 When the tax revenues are not equal to zero, this is equivalent to the gain obtained 
by a shift from an interest tax to lump-sum taxation. 

‘1 It is essential throughout this study that the tax changes are unanticipated; for an 
analysis of the effects of anticipated tax changes, see Hall (1971). 

12 The observed elasticity of gross savings 7, with respect to the net rate of return at 
time zero (under the assumption of perfect foresight about future interest rates), can be 
determined from the parameters of the model: n = ae{r*/[(r* + 5)(m + w + 6)(1 — @)]}, 
where a and 6 are defined in n. 7 anda is given in table 2. For the values used below, the 
Cobb-Douglas case (¢ = € = 1), n = .95. 

13 When the tax rate @ is small, the following experiment is symmetrical and gives the 
same result: Assume that the economy is in the steady state with no tax (point E). At 
time zero, the tax is instituted (with lump-sum refunds). The economy moves then on a 
path toward the point A. Call J; the utility level on this path and J, the utility level at the 
steady state E. The excess burden of the interest tax is equal to Jo — Ji. 
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Fic. 1.—The consumption function and the dynamic path 


A Taylor expansion of (9) around k* gives 


AU ~ J(k*) + JRE — B*) + ce (k — hp 


u(c*) u'(c*) de = 
Na ge s k-— k* 
PO rath LD argent ae 
TOE®) dc (k = k*)?, 





2 (0 anak? 


From (5a) dé/dk = f'(k*) — n — w = p* — n — pw. Furthermore, the 
marginal value of capital J’(k*) is equal to the marginal utility of 
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consumption u’(c*). Therefore, 


1 u"(c*) Pc 
AU ~ —|J]"(k*) — ————_ 
bray pt Fe. ay 
This welfare cost is of a second order with respect to the tax rate. It 
1s convenient to divide it by the marginal utility of consumption at the 


stationary point E, u’(c*), in order to obtain its wealth equivalent 
AM :"4 





ce 
2 
AM = Lé eT (11) 
with 
PD elemie ss 


the parameters a and y have been defined in the previous section. 
The relation (11) is to be interpreted as follows: The welfare cost of 
a Capital income tax at the (small) rate 6 is equivalent to a permanent 
reduction of consumption on the balanced growth path by a fraction 
L@?. The variable L depends on the technology, the utility function, 
and the growth rate. We now consider some of its properties. 
The relation (12) can be rewritten, 





L = 1,| BB (13) 
Y 
where 
Pel r* 2 
Lider al spear (14) 


From (14), and the definition of a and y (eq. [7]), it is straightfor- 
ward to derive the following properties: 


OL OL 


<4 Z ——_—_ < 
L <Lp; —— ae 0; ao 0; 
lim L = Lp; lim L = 0. 

E00 e>0 


The excess burden of the tax is an increasing function of the 
elasticity of substitution € between capital and labor in the production 
function. When e is equal to zero, the capital-labor ratio is fixed, and 
since the labor supply is fixed, there is no distortion. The welfare cost 
is nil. When € is infinite, the gross factor prices are fixed, L = Lp. This 


14 The relations (11) and (12) are derived in the Appendix, Sec. II. 
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TABLE 1 


NorMALIZED EXCESS BURDEN OF THE CAPITAL INCOME TAX 








€ 








o ve i 6 8 1.0 PZ 1.4 1.6 1.8 2.0 00 
Pleo le 5 ONG 1 OME: S3in/,OOmNC2o)l mmo. 2am OO 2mm LOO 
10 1.21 2.33 3.41 445 546 644 7.40 8.33 9.25 10.14 200.0 
Ws LN) BO Bey Zeey) a Gals 70s I She 9.58 133.33 
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Norte.—Given by eq. (12), in percentage of annual consumption. 


TABLE 2 


ANNUAL RATE OF CONVERGENCE TOWARD BALANCED GROWTH PATH 














o EZ 4 6 8 1.0 2 ge 1.6 1.8 2.0 
POMS S198 me 3/4. eee) eee. Ola 14. Oem 3) 2 2 Lee () 1 577, OL) 
OR 2 35/4 ae 1625 ee 13 de ete LOO 9.14 8.40 7.80 7.30 6.88 
OOS 3 SO N70 Oa 8.08 7.30 6.69 6.21 5.80 5.46 
2.0) 16:51) 11E40 Omer 7.80 6.88 6.21 5.68 5.26 ao 4.62 





Nore.—a is measured in percentage. 


case is analogous to the partial equilibrium situation with exogenous 
factor prices and provides an upper bound for the general formula.!® 

Since 1/o is an index of the intertemporal substitutability of con- 
sumption in the utility function, it is not surprising that the excess 
burden is a decreasing function of o. 

Tables 1 and 2 present estimates of the variable L and of the annual 
rate of convergence of the economy toward the steady state a for 
different values of € and a.’* The other parameters of the model are 
chosen to characterize the U.S. economy:!” w = .02,n = .0, the gross 
capital income share is equal to .33, the rate of capital depreciation is 
equal to 5 percent, and p* is equal to the long-run value of the net rate 
of return and is taken to be equal to 4 percent. There remain only two 


* This formula has been derived by Levhari and Sheshinski (1972) for a stationary 
economy. Another proof, using the well-known Harberger-Hicks-Hotelling formula, is 
given in the Appendix, Sec. I. 

'6 These values of the rate of convergence can be compared with those discussed by 
Sato (1966) and give additional information for a choice of € and co. 

7 See, e.g., Jorgenson and Christensen 1973. 
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unknown parameters, p and a. Since there is no general agreement 
about their values,'® we present estimates for different values of o. 
Once a value is chosen for o, the pure rate of time preference is 
implicitly determined by the relation, p = p* — op. 

As can be seen from table 1, for realistic values of € the excess 
burden is not very sensitive to the elasticity of the marginal utility o.! 

The last column in table 1 corresponds to the partial equilibrium 
case. It is clear that for realistic values of €, the excess burden is much 
smaller. The variable L is represented as a function of e for a fixed 
value of a, equal to one, on figure 2 (curve B = 0). For the relevant 
range of €, the excess burden of the interest tax increases almost 
linearly with e. 

The value of the normalized excess burden L depends also on the 
discount rate r* and on the growth rate n + w. No general rule can be 
derived about the effects of these parameters. When the elasticity € is 
large, L increases with n + w and decreases with r* (as it is clear from 
the limit case € = ®”, described by eq. [14]). However, when the 
elasticity of substitution is smaller than two, and the other parameters 
of the model have the same values as above, it is found that L is an 
increasing function of both the discount rate and the growth rate.?° 


The Excess Burden for Large Tax Rates 


The measure of the excess burden given by expressions (11) and (12) 
is exact only for infinitesimal values of the tax rates. In order to 
estimate the bias involved for large values of 6, we have to use a 
different method. 

At time zero, the economy is on its balanced growth path under 
taxation (point 4 on fig. 1), and the initial values of the capital stock 
and the consumption level are given by (4a) and (5a). The dynamic 
path after the tax cut, characterized by (2) and (3), is transformed into 


18 Available estimates of o seem to point to a value higher than one. Weber (1975) 
reports values between 1.3 and 1.8. Wright's (1969) estimates are somewhat higher— 


around 4. 
19 In the limit case where the discount rate r* and the growth rate are equal, the 


variable L becomes 


Cs 





l Ma = a eee ae 
amor (ke) Se 2a)? paneer ; 


which is independent of the utility function and linear in the elasticity € (ceteris 
paribus). ‘ 

20 When e€ = o = 1, the following values are obtained: Take n + w = 2 percent; when 
r* increases from 2 percent to 6 percent, L increases from 3.00 percent to 6.96 percent. 
When r* is equal to 4 percent and n + ym increases from 2 percent to 4 percent, L 
increases from 5.46 percent (table 1) to 7.26 percent. 
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(Gin 9) 





0 <P ibs (65 2.0 


Fic. 2.—The normalized excess burden; € = elasticity of substitution between capital 
and labor, 8 = compensated elasticity of substitution between leisure and consumption. 


a discrete formulation and simulated numerically using a gradient 
algorithm of optimal control. The consumption equivalent of the 
utility after the tax cut c, is defined by the stationary level providing 
the same utility as the dynamic path after the tax cut: u(c,)/(p* — n — 
uw) = J(k). Table 3 reports the ratio [(c, — €)/é](1/62) as a function of 6 
for o = € = 1; the other parameters of the model have the same values 
as in the first two tables. 

The first number in the table (for 6 = 0) is obtained by formula (12). 
When @ is small, it gives a good approximation of the normalized 
excess burden. However, we can see that for large values of the tax 
rate 6, the approximation formula (12) underestimates the welfare 
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TABLE 3 


NORMALIZED EXCESS BURDEN FOR LARGE TAX RATES (%) 














9 
0 05 1 2 3 a 5 
LURES) wiete 5.82 6.25 7.27 8.58 10.33 12.77 


c e? 


Note.—o = € = l. 





gain of the tax cut (for 6 = 50 percent, by a factor slightly smaller than 
aes 

In order to illustrate the previous discussion, let us consider two 
numerical examples of a shift from capital income taxation to lump- 
sum taxation. 

Assume that the aggregate consumption in the steady state under 
taxation is normalized to one. When the tax rate @ is equal to 50 
percent, total revenues are equal to 12.34 percent. A shift to a lump- 
sum tax increases the level of welfare by a consumption equivalent of 
3.19 percent and the level of consumption in the long run by 8.38 
percent. The welfare cost of the capital income tax is equal to 26 
percent of the amount of tax revenues. The marginal welfare cost is 
obtained by multiplying the average cost by a factor of two (this factor 
should be somewhat greater than two for large rates, since the nor- 
malized excess burden increases with the tax rate). 

When @ is equal to 30 percent, tax revenues are equal to 6.73 
percent. A shift to lump-sum taxation would increase consumption in 
the long run by 3.3 percent. The welfare cost of the tax is equal to 
0.77 percent of the consumption level or 11 percent of the revenues. 

The method followed in this section takes into account both the 
short-run and the long-run incidence of the suppression of the capital 
income tax. We can observe that the welfare gain of tax reform 
(measured in consumption equivalent) is much smaller than the per- 
centage increase of consumption in the long run. Also, the excess 
burden of the capital income tax, although small in terms of aggre- 
gate consumption, is not negligible in terms of the revenues 
generated. 


Ill. The Case of the Nonperfect Foresight 


In the previous section, the welfare. gain of a reform from capital 
income taxation to lump-sum taxation was determined under the 


21 On the other side, when @ is large, the model considered here implies a sudden 
increase of saving which could be too large to be borne by a real economy. Therefore, it 
is likely that the numbers in table 3 may be overestimated. 
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assumption that the economy follows an optimal path toward the new 
steady state. However, when there is no complete set of future mar- 
kets to convey information on future prices, individuals may not have 
perfect foresight. In this case, the dynamic path after the tax reform is 
no longer optimal, and the welfare gain obtained is smaller.” 

The assumption of perfect foresight is relaxed in this section. For 
its saving decision, the competitive household relies now on point 
expectations about the future wage and interest rates. By assumption, 
these expectations satisfy the following property: At each instant ¢, 
anticipated future prices depend only on the value of the capital stock 
at time ¢ (e.g., through the factor prices at time f) and on some 
constant parameters (as the long-run values of the factor prices).”* It 
follows that the consumption level can be expressed as a function of k, 
c,(R). 

We also assume that expectations are consistent with the steady 
state and do not affect its stability. The rate of convergence of the 
economy a, depends on the type of expectations considered. For 
example, in the case of myopic expectations (where the factor prices 
observed at a given time are expected to be the same in the future), 
the rate of convergence is greater than in the case of perfect foresight. 

As in the previous section the initial situation of the economy is the 
steady state with taxation. After the suppression of the tax at time 
zero, the economy converges to the steady state E on the path T’'(B’E), 
which in general is different from the path [. The welfare gain is 
measured by the difference, 


fy u(c) 
AU = J(k) — ean (15) 
where ](k) represents the integral (1) on the path B’E. 

The second-order approximation of the utility level after the tax 
cut, J (k), depends only on the slope of the dynamic path at the steady 
state,” c;(k*), or on the rate of adjustment toward the steady state a, 
(since a; = c7[k*] — [r* — n — p)): 


J (h) = F(a, hy; (16) 
this utility is smaller than the utility on the optimal path, 
F(a;, k) < F(a, k) = J(k). (17) 


» For the class of expectations considered below, the dynamic path still converges to 
the steady state. 

*8 This class includes perfect foresight and myopic, stationary, and regressive expec- 
tations. This section uses some intuitive results derived in Chamley (1979). 

4 A proof is given in the Appendix, Sec. III. 
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The welfare gain induced by the tax cut is equivalent to a perma- 
nent increase of the consumption level by a fraction L,6?, where L, is 
defined by 


Ly LQ (a))/Q(a)] Ls (18) 
with 
= ay ay - 4 Z 
Qa) NEP 2a al ek). OXY Uy 


L is equal to the perfect-foresight value defined by (12) in the previous 
section. 

Using the relations (18) and (19) and the properties of a and c'(k*) 
(described in Sec. I), it is straightforward to show that the ratio L,/L is 
always smaller than one. When the private sector does not anticipate 
future prices with perfect foresight, the welfare gain induced by the 
suppression of the capital income tax is reduced. 

The ratio between the values of the excess burden with nonoptimal 
and optimal adjustments L,/L is represented in figure 3 as a function 
of the ratio between the rates of convergence (a,;/a). The different 
curves correspond to different values of the discount rate r*. The 
growth rate n + yu is equal to 2 percent, and the product ge is equal to 
i. 

Also the cases r* = 3 percent and r* = 6 percent provide a very 
good approximation (less than 1 percent error) of the cases r* = 4 
percent, ge = 4, and r* = 4 percent, ce = 2.5, respectively. 

It is interesting to observe that, in general, a shift from capital 
income taxation to lump-sum taxation always induces a welfare gain 
(L; is positive unless a; has an unrealistically large value, greater than 
[12.1] - a). This gain is close to its maximum for a wide range of values 
of the ratio (a,/a). When this ratio is between 4 and 3, L; is equal to at 
least 86 percent of L (for r* = 4 percent, ce = 1). 

As an example, let us consider the case of myopic expectations. The 
consumption function takes then the following form: 


es(k) = (1+ 22) Lf) — (n+ we 


Using (18) and (19), after elementary manipulations, the ratio be- 
tween the normalized excess burden under myopic expectations and 
its value under perfect foresight, L/L, is given by 


eek KV \? + 4y 
Id = 5 (lt ers (20) 
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with \ = r* — n — w, and y is the parameter described in equation (7). 
A good approximation of (20) is given by 
1 » 
LoL = 5:1 + nec (21) 
where a is the optimal rate of convergence reported in table 2. 

Under myopic expectations, the welfare gain of the tax reform is 
always equal to at least 50 percent of the perfect-foresight value. This 
case is represented by the point M on figure 3 (for e = o = 1, the rate 
of convergence toward the steady state under myopic expectations as 
is about six times larger than the value under perfect foresight a). The 
ratio L/L depends on the long-run value of the rate of return r*, the 
growth rate n + w, and the parameters e and o. Whenr* = 0.04 and n 
+ pw = 0.02, itcan be determined for different values of € and, using 
table 2 and the relation (21). 

We see in figure 3 that, for the relevant range of values of the 
various parameters, the ratio between the welfare gains of tax reform 
under myopic expectations and under perfect foresight is between .55 
and .65 and is not very sensitive to the parameters of the model.” 

Figure 3 also allows us to consider a more general class of expecta- 
tions. For example, when the private sector underestimates the future 
rate of convergence of the economy, the realized rate of convergence 
a, is in the interval (a, as). For this special class of expectations, the 
welfare gain of a tax cut is bounded below by the value found in the 
myopic case. 

Finally, it may be interesting to observe that when the private sector 
applies a modest amount of rationality, the dynamic path of the 
economy is close to the optimal path. Around the steady state, the 
variation through time of the factor prices is given by the expression, 
ere ee ie, a) 

Assume now that the long-run values of the factor prices are known 
and that the only uncertainty left is about their rate of convergence to 
the new steady state. Furthermore, individual expectations are of the 
regressive form; the expected values of a parameter x, at some future 
date t, xf, are determined by the following rule: 


(x? — x*) = —alke? —' x*), (22) 
Xo = Xo. 


Individuals revise at each instant the estimated value a of the future 
rate of convergence, using the observed value 4 (given by @ = 2/(z* — 


25 When the difference \ between the rate of return r* and the growth rate n + w 
tends to zero, the ratio L,/L tends to 1/2. Also, at the same time, the error implied by 
myopic expectations, which is measured by the ratio a,/a, tends to infinity. 
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z), where z is an arbitrarily chosen endogenous variable) and an 
adaptive rule, for example, @ = v(@ — a), where v > 0. 

Under these assumptions, and for an initial value of the capital 
stock not too different from the steady-state value, both the estimated 
value a and the actual value 4 of the rate of convergence tend to the 
optimal value a.2* The dynamic path is tangent to the perfect foresight 
path at the steady-state point. The value of the welfare gain of tax 
reform is close to its (maximum) perfect-foresight value. 


IV. The Welfare Cost of the Corporation Tax 


Most studies on the corporation tax assume that the corporate and the 
noncorporate sectors produce two different goods and that the ratio 
between their respective outputs depends on their relative prices.?’ 
For the sake of simplicity, we assume here that both sectors produce 
the same good and that the aggregate production function can be 
written in the form,”* y = g(k,, k2), where all quantities are divided by 
the total effective labor supply, and k, and k, represent the corporate 
and the noncorporate capital stock, respectively. The function g(hy, 2) 
is assumed to be homogeneous in its arguments. 

Initially the economy is in the steady state where only the corporate 
capital income is taxed at the rate 6. In each sector, the net rate of 
return is equal to the optimal stationary value p*, and the values of the 
capital stocks, k, and k,, are determined by 


_ Of 57 ey Ofer ey 
*x* — = = 
p* = (1 ~ 6) 58 (has a) = SE as fa). (23) 


The elimination of the corporate tax at time zero has two effects. 
First, capital is reallocated between the two sectors to equalize their 


*° Numerical simulations have shown that for almost any initial value of the capital 
stock, the convergence to the perfect-foresight path occurs after only a few periods. 
This procedure may be used as an algorithm to determine the perfect-foresight solu- 
tion. 

27 See Harberger (1962, 1976), Shoven and Whalley (1972), Boadway and Tred- 
denick (1975), and Shoven (1976) for a static analysis. Friedlander and Vandendorpe 
(1978) extend Harberger’s study to a dynamic framework. However, they do not 
address the problem of the incidence on the intertemporal welfare and do not consider 
a saving function derived from the optimization of an intertemporal utility function. 
es Shoven, and Whalley (1979) are currently working on a more elaborate 
model. 

*8 The usual assumption that the corporate and the noncorporate sectors produce 
two separate goods may also be a crude description of reality; see, e.g., Ebrill and 
Hartman 1977. 
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rates of return. The aggregate production function now takes the 
form, 


y = fh), (24) 


where 


f (hk) = max g (hy, ke). 
ky +ko=k 


After time zero, the private sector is endowed with perfect 
foresight, and the economy moves on the dynamic path studied in 
Section I, which is characterized by the equations, ¢ = (c/a)[f'(k) — 

p*],k =f (k) — (n + p)k —c, and ky =k =k, + ky. As in the relation (9), 
the welfare gain of the tax cut is equal to AU = J(k) — [w(é)/(r* — n — 
#)]. This difference can be decomposed into 


=|J(k) - =e ‘e Oe) (25) 


Te LL 


where ¢ = f(k) — (n + p)k is the stationary consumption available 
when the level of capital & is allocated efficiently between the corpo- 
rate and the noncorporate sectors. 

The first term of the RHS (right-hand side) of (25) has been 
analyzed in Section II and measures the intertemporal welfare cost of 
the corporate tax, originating in the reduction of capital accumula- 
tion. It is equivalent to a permanent reduction of the consumption 
level by a fraction L;,6*, where Lc, is defined by?® 


peepee (BL) ere (26) 


Since the tax revenues are equal to 6r*k,, this relation implies that for 
small values of the tax rates, the intertemporal welfare cost of capital 
income taxation depends only on the total amount of tax revenues 
(e.g., a 10 percent uniform tax on the whole capital stock and a 20 
percent tax on half the capital stock have up to the second order the 
same intertemporal welfare cost).*° 

Although the distribution of the tax burden across the different 
sectors does not affect the intertemporal welfare cost, it creates a 
distortion in the allocation of the capital stock between sectors. This 


29 A proof is given in the Appendix, Sec. IV. 

30 The extension to many sectors is straightforward. In particular, this analysis could 
be applied to a model with four types of capital: corporate and noncorporate capital, 
housing, and human capital. 
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production cost is measured by the second term in the RHS of (25) 
and is equal to! 


eu) ue) 5 ee ©) petty es 
(*—n—p) (r#¥—n- 4p) Lf() (hi, )] (27) 


The welfare cost induced by the production inefficiency is equivalent 
to a permanent reduction of the consumption level by a fraction L726", 
where 


Leo? = [f(k) = gh, ky) \/e. (28) 


This is precisely the excess burden studied by Harberger and others. 

The total welfare cost of the corporate tax is equal to the sum of the 
intertemporal cost and of the cost due to production inefficiency. It is 
equivalent to a permanent reduction of the consumption level by a 
fraction L,6?, where 


2 i [ f(A) iv Eller, he) | (29) 
Cc 


Lae Ue le = L Ge 


k,, ky, and k are given by (21), and 
6 f(s aC eee) (30) 


In order to have an order of magnitude of the respective quantities, 
assume that the ratio k,/k is equal to 3, which is close to the value 
observed in the U.S. economy, and that only the capital income 
originating in the corporate sector is taxed at the rate of 50 percent. 
Using the results of Section II, the intertemporal welfare cost is about 
0.25 percent of the level of aggregate consumption (or around 7.8 
percent of the tax revenues).** The value is somewhat lower than the 
cost of intersectoral misallocation, which in the same model is equal 
to 1.1 percent of consumption (this value is well within the range of 
current estimates of 0.5-1.5 percent; see Shoven 1976). The interin- 
dustry misallocation caused by the corporate tax seems to be much 
greater than the intertemporal distortion. 


V. The Case of an Elastic Labor Supply 


The capital income tax increases the gross rate of return and, by the 
factor price frontier, lowers the wage rate. When the labor supply 1s 


31 To a second order, it is equivalent to evaluate this difference at k or k*. 

2 ‘These estimates are obtained with a simplified version of the model in Section II: 
There is no depreciation, and the production function takes the form y = kk (with a 50 
percent corporate tax rate, k,/k, = 0.5). The capital income share, net of depreciation, 
2B, is the same as for the previous numerical results and is equal to 0.18. We have also 
used table 3 in the correction for large tax rates. 
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not fixed, this tax creates, in addition to the intertemporal distortion, 
a distortion in the choice between consumption and leisure at a given 
instant of time. 

For example, assume that the elasticity of substitution between 
capital and labor in the production function is equal to zero; the 
capital-labor ratio is constant. When the labor supply is fixed, the tax 
has no incidence on the capital stock and is equivalent to a lump-sum 
tax with no excess burden. However, in the long run the gross rate of 
return is higher and, by the price possibility frontier, the wage rate is 
lower. When the labor supply depends on the net factor prices, the 
incidence of the tax is to decrease the long-run labor supply, the 
capital stock, and the consumption level.** 

In general, it can be expected that the excess burden of the capital 
income tax is increased when the labor supply is variable. In this 
section, the framework developed in Section II is extended in order to 
measure this additional effect. 

The utility function of the private sector depends on the amounts 
of consumption and leisure: 


U -{ e ey (c,e"", L,)dl, (31) 
0 


where c,e"' is the consumption per capita and /; is the labor supply per 
capita (measured in natural units, and not in effective units). 

For the sake of simplicity, we consider the following form for the 
utility function u:*4 w(c,l) = (1 — B) loge + B log (T — 1), where B and 
T are exogenous parameters. The dynamic path of the economy is 
now characterized by the relations, 


6 —cti7{l—0)< p*}, (32) 
hy = f(hi,l:) — (n +m) — Ct, (33) 


and 


poe oe PC a = 
d-pir-h ne 
where r; and w;, are the gross interest and wage rates, respectively, and 
k, represents now the aggregate capital stock divided by the efficiency 
index e**. 
Equations (32) and (33) are the same as in the fixed labor supply 
case, and, at each time ft, the marginal rate of substitution between the 


33 A sufficient condition for these properties to be verified is the homotheticity of the 
utility function. This homotheticity is a necessary condition for the existence of an 
optimal balanced growth path. S Pee: 

34 When uy is different from 0, and the elasticity of substitution in u between con- 
sumption and leisure is different from 1, there is no optimal balanced growth path. 
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consumption of leisure and of produced goods is equal to the wage 
rate (eq. [34]). 

At a given instant, the labor supply depends not only on the wage 
rate at the same moment but also on the future factor prices (wage 
and interest rates). 

The initial state of the economy (where the tax has been in effect 
for an infinite amount of time) is defined by the stationary equivalents 
O132)= (34): 





a pa 
7 = Tau (35) 
é=A()l, (36) 
and 
Se utr 37 
(aT) w(r), (37) 


where A (r) is defined using the production function, A (r) = r[(k/l)(r)] 
+ w(r) — n — p, and the capital-labor ratiok// is expressed as a function 
of the gross rate of return r. 

At time zero, the tax is suppressed. Thereafter, the dynamic be- 
havior of the economy is described by (32), (33), and (34), where 6 is 
replaced by 0; the new steady state is characterized by 


po (38) 
c* = A(r*)I*, (39) 


and 


Bc* i 
aq-sqr-m oe 
The increase of welfare induced by the tax cut can be approximated 
to the second order by the same method as in Section II; its wealth 
equivalent is of the form, AM = Lg [c*/(r* — n — w)]6”, which depends 
on the parameter B. The formula for Lg is rather complicated,®° so we 
present in tables 4 and 5 some numerical estimates analogous to those 
in Section II. 

Also, Lg is represented as a function of € in figure 2, for the values B 
= 0, .6, 1; 8 measures the long-run compensated elasticity of the labor 
supply with respect to the wage rate. When it is equal to zero, the 
labor supply is fixed; this is the case studied in Section II. 

In table 4 and figure 2, we can see that the excess burden L 36”, for a 
given value of 6, is increasing with B and is bounded by the value of 


35 A computer program for its numerical evaluation is available. 
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L,6?, obtained for B = 1.°° It is remarkable that the difference Lg — Lo 
does not appear to be significantly affected by the value of e*” (for € 
smaller than 2) and that this difference is relatively small with respect 
to Ly when the production function is of the Cobb-Douglas type (€ = 
1) 

ee e is infinite, we have the partial equilibrium case again. It is 
straightforward to show that the relation (14) is still valid. 

The welfare cost of the capital income tax at the rate @ is measured 
by a fraction of the full consumption of produced goods and of 
leisure (valued at the wage rate), which is equal to Lp@?, where Ly = 
rt/(r* — n — p)P. 


VI. Conclusion 


The general equilibrium models used in this study are highly stylized. 
However, the numerical examples considered indicate that the 
character of its results is fairly general. 

The value of the excess burden of the capital income tax depends 
mainly on the elasticity of substitution between capital and labor e. 
When é¢ is smaller than two, the excess burden is almost proportional 
to e. No general rule could be derived about the effects of the dis- 
count rate. When the other parameters of the model are chosen to 
characterize the U.S. economy, the excess burden of the capital in- 
come tax increases with the discount rate and the growth rate. 

Interestingly enough, when € is smaller than two, an increase of the 
intertemporal substitutability of consumption (measured here by the 
inverse of the marginal utility of consumption) increases the value of 
the excess burden only by a negligible amount. In this case, although 
the deadweight loss of the tax is relatively small with respect to the 
level of aggregate consumption, or to its value in the case of fixed 
factor prices (€ = ©), it is not negligible with respect to the amount of tax 
revenues: A capital income tax of 50 percent implies a welfare loss 
equivalent to 26 percent of the tax revenues; when the tax rate is 
equal to 30 percent, the deadweight loss is equal to 11 percent of the 
tax revenues. These values should be multiplied by a number between 
1 and 4/3 if the labor supply is elastic. 

These numbers are indicative of the welfare gain induced by a shift 
from capital income taxation to lump-sum taxation, when individuals 


°° ‘The compensated elasticity of the labor supply B is bounded by 1. 

*7 Numerical experiments have shown that the difference L, — Ly is not sensitive to 
variations of 8, p, or yw, for realistic values of e«. When € is equal to zero, the excess 
burden of the capital income tax arises because of its incidence on the wage rate. In this 
special case, the interest tax has the same excess burden as the wage tax which generates 
an equal amount of revenues (Chamley 1980a). 
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have perfect foresight about the behavior of the economy after the tax 
is suppressed. Since the determination of the perfect-foresight path is 
not an easy task for most economists, even in a simple model, this 
assumption may not be realistic. When the tax reform does not lead to 
an optimal path, its benefits are reduced. For example, under myopic 
expectations, the welfare gain induced by the abolition of the capital 
income tax is only equal to about 60 percent of its value under perfect 
foresight. However, myopic expectations grossly underestimate the 
future behavior of the economy. When individuals anticipate some- 
what this future behavior, the welfare gain of tax reform is close to its 
maximum value, even if the degree of foresight is not very accurate.** 
(A method to approximate the perfect foresight path has also been 
suggested in Sec. III.) 

Finally, under the present U.S. tax system, the rates of the capital 
income tax vary by a large amount from one sector of production to 
another. When the overall elasticity of substitution between capital 
and labor is not too large (less than two), the welfare gains obtained by 
an equalization of these rates dwarf the gains obtained by a reduction 
of the global tax on capital income.*® 


Appendix 


I. The Excess Burden in Partial Equilibrium 


An income equivalent of the excess burden is given by the well-known 
Harberger-Hicks-Hotelling formula. In a continuous-time formulation, this 
becomes 





1 + io Oc; Uv t ’ 
iva ApApe| ) e™dtdt', (Al) 
2) Jo 0 


pe 
with the following notation: 
m=n+ pw 
pr = e779 (price of c; at time 0), 


Ap, = r*6te7"™* 


U 
Pe ) , compensated derivative of c; with respect to py. 


Ope 


38 The assumption of myopic expectations*is used in Hudson and Jorgenson (1976) 
and in Fullerton et al. (1979). Both studies rely on multisectoral models where one type 
of capital is allocated between the different sectors of production. Because there is only 
one state variable (the aggregate capital stock), regressive expectations are still 
asymptotically correct, and the discussion in Sec. II applies; in particular, the mecha- 
nism suggested there could be used to improve the accuracy of foresight. 

39 This result is also found in Fullerton et al. (1979). 
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By application of Slutsky’s equation, 


ab t (f° (elon , 
= mn Ap, dt 
ae 250 ee 0 Ope Pe dt 


rhe mt c ({M Oey (f" mt! | 
ci ale é Ap (— 4 " é cu A py dt dt. 


The first expression in parentheses is simply equal to the variation of the 
consumption at time ¢, Ac;, after an uncompensated interest tax has been applied 
(at time zero). 

Since the utility function is homogeneous, the income elasticity of con- 
sumption is equal to one, and the excess burden is given by 


2 
J omc palt) 5 
0 


a1” om l | 
AM=5 és é ApAcdt + oM 


From (2) and the budget constraint, the uncompensated demand function, ¢;, 
is expressed as follows: c; = vMe%, withy = r* —n — w — Or*[1 — (1/o)J, anda 
= —(r*/o)@. Therefore, when @ is small, 


NGp = Mir (2 = 3] of eal 








oO 


We replace Ac; in the above expression for AM, and since the wealth M is 
equal to the present value of the consumption stream on the balanced growth 
path, we have 


c* 


AM = -Lp 62, (A2) 


Viet aT 


1 r* < 
1 al i hp ), 
For a given value of the interest tax rate, the welfare cost in partial equilib- 
rium is an increasing function of the intertemporal substitutability of con- 


sumption in the utility function, which is measured by I/c. It is a decreasing 
function of the discount rate and an increasing function of the growth rate. 


with 


P 


Il. The Excess Burden in General Equilibrium 


We determine here a second-order equivalent of the welfare gain AU, ob- 
tained by the suppression of the interest tax: AU = J(k) — [weE)A] (X = p* —n 
— p). The terms J(k) and u(c) are considered separately. 

The term J(k) is equal to 


Jib = [o> erat 


0 
where c; is taken on the dynamic path BE (fig. 1). This integral can be 
decomposed into two terms: 


oo 


J = u(c)At | e u(c,dt, 


At 
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where A‘ is a small interval of time. By taking an infinitesimal value for At, we 
verify that the function J satisfies the relation, 0 = u(c) — AJ + J'(k)k; or, se 
the capital accumulation equation (eq. [3]), 0 = ulc(k)] — AJR) + JA) (A) — 
+ p)k — c(k)). 

Differentiating this expression twice at the point k*, we obtain 

u'c 12 + f'u ! 
Reta 
where u’ = (du/dc)(c*), u" = (d?uldc?)(c*), c' = (de/dk)(k*), c"” = [d2c(k*)Vdk?, and 
Gi— 6 (hk) a Ne _ 

Therefore, a second-order approximation of J at & is given by 
meee Ee, 
2 2a ye 
Using the properties of the optimal consumption function described in Sec- 
oon I of the text, we can rearrange the term of the second order: 


cl? spits ! > ul (2 c | 


JAX) = ul(c*), J"(R*) = 


t= 7 + u'(k — k*) + "cl? + flu'y(k — k*)?, (A3) 








on + 2a) 2(’ + 2a) Co 
Se OR ad) nt CG OG) 2th c! 
2( + 2a) 2(A + 2a) 2 
Hence, 
J@= =—+u '(k ko Xe hey. (A4) 


The value of consumption in the steady state under taxation, ¢, is determined 
by é¢ =f (k) — (n + w)k. The second-order equivalent of w(é)/A can be written as 





+ul(k —k*) + x (uf + w'r2)(k — RP, (A5) 


Ca , = *)\2 
aoa (A6) 
Taking the difference between J(k) and u(c)/d (in [A4] and [A6]), we have 





uw 
r 
uU < u" 
= — + u'(k — k*) + — 
wi =| 


u=-4£ &- Key, 
A 5 ( ) 


where a is the positive root of x? + Ax — y = 0 (described in text Sec. I). 
Since f' (k) = p*/(1 — 0) and (k — k*)f" ~ p*6, the welfare change can be 
rewritten as 


Au = HL & aN) Ae ee pte 
20 A\A Cha Troe pp 


melee 
iS on) see 
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(the partial equilibrium value of L expressed by the relation [14] in the text), 
and 


We —of "(k*) 


oO 


The term Aa/y is equal to the positive root of the equation (y/A”)x? +x — 1= 
(0. This root is contained in the interval (0,1). The derivation of its properties 
when e varies is straightforward. The sign of 0L/0q is obtained by the same 
method. 


III. The Case of Nonperfect Foresight 


When the private sector is not endowed with perfect foresight, c'(k*) is no 
longer the positive root of the equation x? — Ax — y = 0. The welfare gain of 
the interest tax cut is obtained by the same method as before. However, taking 
the difference between (A3) and (A5), we find 


bie if 1 1 o cr 
w(t rt Hb x 
fs | D a Nets 2a; r c NSE 2a; 

where c; and a, represent the consumption function and the coefficient of 
adjustment of the economy toward the balanced growth path, under nonper- 
fect foresight. 

Use the relation a; = c; — A and the definition of the perfect-foresight 
values for a; and cj, respectively a and c’, to obtain 

! ” k aes RG 
au = wens") (AS) Qa), 














where 





= ay ay ‘4 oa 

Qa) dX + 2a; | ac' r i 

The consumption equivalent of the welfare gain can then be expressed as L, 
= L - Q(a;)/Q(a), where L is the value of L; under perfect foresight. 


IV. The Corporation Tax 


We examine here the long-run incidence of the corporation tax on the capital 
stock. The capital stocks in the corporate and in the noncorporate sector, k, 
and k,, respectively, are determined in the long run by the relations p* = (1 — 
0)g1(k,, ko) = go(ky, ky). Differentiate this expression around 6 = 0 to obtain the 
variation of the aggregate capital stock, dk: 


dk = dk, + dky = (He EB re, (A7) 


with A = 911832 — 212221. Assume now that the tax is applied uniformly to both 
sectors, and call Dk the variation of the capital stock: 


Dk = ( gi + " = 28% Jove. (A8) 
Taking the ratio between (A7) and (A8), 


aka 822 — Lie 


Dk fi + S22 — 2g 
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Since the production function g(k,, k,) is homogeneous, it can be rewritten 
under the form g(k,, kz) = H[F (k,, k2)], where F is homogeneous of degree 
one. 

Use the properties of F, and the equality of F; and F} (at @ = 0), to write 


G22 — S12 = H' (Fe — Fp) = H'F ( ie 


1 


2 
gh + Ble — 2g", = H'F hy 1 eee) } 
ky ky 
Taking the ratio between these relations, dk/Dk = k,/k. This result could be 
generalized easily. The long-run incidence of a sectoral capital income tax on 
the total capital stock is proportional to the share of the taxed capital with 
respect to the total capital stock. 
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Some standard topics in the theory of international trade are recon- 
sidered in this paper by distinguishing between national and aggre- 
gate income when fixed supplies of foreign inputs are present within 
the home country. Under conditions that would ensure a national 
welfare gain if foreign ownership were absent, international trans- 
fer, economic growth, or tariff policy might cause a national welfare 
loss in the presence of foreign ownership. The techniques developed 
could be applied to other domestic distinctions (such as those based 
on race, sex, age, or ethnicity) and to the theory of customs unions in 
a three-country world. 


I. Introduction 


This paper reconsiders a number of standard topics in the theory 
of international trade by taking explicit account of the distinction 
between national and aggregate income when fixed supplies of 
foreign-owned inputs are present within the domestic economy. Ex- 
tending the work of Bhagwati and Brecher (1980),’ the following 
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partial financial support of the research underlying this paper. The comments and 
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are gratefully acknowledged. 
! Their work, in turn, extends the analysis of Bhagwati and Tironi (1980), who 


concentrate on a special case mentioned in n. 3 below. 


[Journal of Political Economy, 1981, vol. 89, no. 3] 
© 1981 by The University of Chicago. 0022-3808/81/8903-0004$01.50 


ao7 


498 JOURNAL OF POLITICAL ECONOMY 


analysis takes a new look at welfare-theoretic aspects of international 
transfer, economic expansion, and tariff policy, while it emphasizes 
significant departures from conventional wisdom that arise in the 
presence of foreign ownership. As these selected departures suggest, 
many standard results are open to serious question when part of the 
domestic product accrues to factor inputs from abroad. 

Originally, the motivation for the present two-group analysis (based 
on the national-foreign distinction) came from a recent concern in 
Latin America, where policymakers have been worried about the im- 
pact of trade liberalization on national welfare, given the domestic 
presence of foreign-owned multinational corporations. After further 
reflection, however, it is clear that the treatment below has much 
greater applicability to a broad range of analytically similar cases. For 
example, it is possible to treat in much the same way a wide variety of 
alternative domestic distinctions, including those based on race, sex, 
age, or ethnicity. The following techniques and results, moreover, are 
directly relevant for the fully analogous two-group issue relating to 
the distribution of gains (or losses) between trading partners in a 
customs union (such as the European Economic Community) with 
factor mobility. While these other policy problems are of considerable 
importance and interest as well, only the national-foreign distinction 
is pursued explicitly here for the sake of brevity. 

Section II reviews the basic model of an open economy, in which 
foreign-owned and national supplies of two homogeneous factors are 
combined to produce two commodities. As Section III then shows, a 
transfer-receiving country might suffer a loss in national welfare, 
even under the usual conditions which would ensure a welfare gain if 
foreign ownership were absent. As established next by Section IV, a 
country experiencing economic expansion (due to factor-endowment 
growth or technological advance) might encounter a deterioration in 
national welfare, even under well-known conditions which would 
preclude this possibility of “immiserizing growth” in the absence of 
foreign ownership. Afterward, Section V explains why free trade 
might be inferior to both no trade and subsidized trade, as far as 
national welfare is concerned.? Section VI summarizes the paper’s 
main results, based on the possibility of aggregate and national wel- 
fare moving in opposite directions. 

Needless to say, this possibility would not arise if foreign-owned 
factors were taxed to the nationally optimal extent. Indeed, with these 
factors in perfectly inelastic supply, the optimal tax on each foreign 


* This result is obtained also by Bhagwati and Tironi (1980), for a special case 
identified in n. 3 below. In addition, since Bhagwati and Brecher (1980) compare 
free-trade equilibrium with autarky, the present paper will emphasize instead the 
comparison of free versus subsidized trade. 
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input clearly would be 100 percent, thereby removing the after-tax 
distinction between aggregate and national welfare. Assuming that 
this type of optimal taxation of factors is politically infeasible, how- 
ever, the present analysis cautions nationally oriented policymakers 
against the usual, automatic adoption of the standard welfare conclu- 
sions which reflect an aggregate point of view. More specifically, this 
paper shows precisely how the traditional (aggregate) propositions 
must be modified for a truly national perspective, when political 
constraints eliminate optimal taxation of inputs from abroad. 


II. The Basic Model 


Following the analysis of Bhagwati and Brecher (1980), the present 
section summarizes the basic two-commodity, two-factor model of an 
open economy (large or small), which plays host to given quantities of 
inputs from abroad. The aggregate factor endowments of the country 
are fixed at K% units of capital and L“ units of labor, while the given 
amounts K” and L” are the national endowments of capital and la- 
bor, respectively. (Thus, the fixed supplies of foreign-owned capital 
and labor within the home country are K* — K" and L¢ — L?, respec- 
tively.) It is assumed that K* > K" > 0 and L«>L"> 0, excluding the 
possibility that either factor within the home economy is owned 
wholly by nationals or completely by foreigners.* Commodity two is 
always labor intensive relative to capital-intensive commodity one, 
and the well-behaved technology exhibits constant returns to scale. 

In figure 1, the home country is depicted in free-trade equilibrium. 
Aggregate production is at point Q? on production-possibility frontier 
TT“ (corresponding to K* and L*), aggregate income is represented 
by budget line Q*D, and aggregate consumption occurs at point D* 
on indifference curve /$/%. (For simplicity of exposition, it is assumed 
that all income earned by factors from abroad is consumed locally, to 
avoid having to show repatriation of such income within the diagram.) 
By the reasoning of Bhagwati and Brecher (1980), national consump- 
tion takes place at point D” on indifference curve Ty”, with national 
income given by budget line Q"D” (parallel to Q*D*), as if nationals 
produced separately at point Q” on production-possibility frontier 
T2T% (drawn for K" and L”).4 To emphasize that the main results of 
this paper qualitatively do not require any differences in consumer 
preferences between nationals and foreigners within the home coun- 
try, assume throughout the text ‘that the same set of indifference 


3 For the special case in which K¢*>K"= 0andL* = L" > 0, see Bhagwati and Tironi 
(1980). 

4 The discussion could be extended readily to allow for the possibility of complete 
specialization, following the analysis of Bhagwati and Brecher (1980). 
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Fic. 1.—Differential trade-volume phenomenon 


curves with unitary income elasticities of demand represents both 
national and aggregate tastes in consumption, although this sim- 
plification of the exposition could be dropped (as in footnotes to this 
paper) without detracting from the essence of the analysis.® 

The model may be summarized conveniently as follows: 


Xi, = Fi(p),i = 1, 2,7 =a,n; (1) 
Yi = Xi + pX4, j =a,n; (2) 
W’ = U3(Ci, C4), j = a, n; (3) 
Cl + pCh = ¥2.7 =a. n: (4) 


where p denotes the relative price of the second commodity in terms 
of the first; X4,denotes output of commodity? on frontier 7374; each F i, 
is a conventional function of p, given K/, L’ and the (uniform) technol- 
ogy for commodity 7; ¥* and Y" denote the real value of aggregate and 
national income, respectively, in terms of the first commodity; C¢ and 
C? denote aggregate and national consumption, respectively, of com- 
modity 2 (@ = 1, 2); W* and W" denote aggregate and national welfare, 


° Nn. 6-8, 10, and 15 below extend the discussion to let tastes differ between 
nationals and foreigners within the home country. These extensions bring out the 
essentially “three-country” flavor of the analysis, in which nationals, domestically lo- 
cated foreigners, and the rest of the world can be treated as three distinct components 
of the international economy. 
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Fic. 2.—Differential trade-pattern phenomenon 


respectively; and each U’ is a concave function of Ci and C4, with 
positive partial derivatives denoted by U¥,= 0U//dC),(@ = 1, 2). 
Later in the paper when a change in relative prices is induced by 
various parametric shifts, either of the following phenomena might 
lead to a fall in national despite a rise in aggregate welfare, depending 
on the strength of other induced effects. The differential trade- 
volume phenomenon is shown in figure 1, where the aggregate (ac- 
tual) volume of trade (defined by line segment Q7D%) is less than the 
national (hypothetical) volume of trade (defined by line segment 
QD"), implying (ceteris paribus) that a terms-of-trade deterioration 
worsens national by more than aggregate welfare. Figure 2 (labeled 
similarly) illustrates the Bhagwati and Brecher (1980) differential 
trade-pattern phenomenon, which arises when the aggregate and 
national patterns of trade differ (in direction), so that an aggregate 
terms-of-trade improvement (tending to raise W%) means a national 
terms-of-trade deterioration (tending to lower W”). The national 
relative to the aggregate endowment of factors is labor abundant in 
figure 1 (with K¢/L¢ > K/L") but capital abundant in figure 2 (with 
K/L* < K/L"), as suggested by the relative shapes of frontiers T3T¢ 
and T27T?, in accordance with the reasoning of Rybczynski (1955). 
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To understand the possibility of a fall in national welfare despite a 
rise in aggregate welfare, it might be tempting to go no further than 
the following simple observation. Whenever the national and aggre- 
gate endowments exhibit different capital/labor ratios, the domestic 
distribution of income might deteriorate for nationals, as a change in 
relative commodity prices alters the wage/rental ratio for reasons 
expounded by Stolper and Samuelson (1941). It is important to rec- 
ognize, however, that generally this income-redistribution effect will 
not be strong enough to produce the differential responses in na- 
tional and aggregate welfare if the relative factor-endowment discrep- 
ancy is too small to create either the differential trade-volume or the 
differential trade-pattern phenomenon. Even when either of these 
phenomena arises, moreover, a fall in national welfare despite a rise 
in aggregate welfare can occur if and only if certain specific conditions 
(derived below) are satisfied. 


III. International Transfer 


According to a standard result in the literature (see Mundell 1960), a 
transfer-receiving country cannot suffer a loss in aggregate welfare 
despite any possible deterioration in the aggregate terms of trade, as 
long as international commodity-market equilibrium is stable. In 
other words, the transfer-induced change in W% cannot be negative, 
assuming that an excess demand for or supply of the second good in 
world markets can be cleared by a rise or fall in p, respectively. As 
the following argument demonstrates, however, a (large) transfer- 
receiving country might suffer a deterioration in national welfare, 
even under the assumption (maintained throughout the present 
paper) that commodity markets are stable. This demonstration of a 
transfer-induced fall in W”, moreover, does not even require a rise in 
the relative price of home importables. 

If it is assumed that the transfer is given only to nationals, equations 
(2) are modified as follows: 


Yi= Xi + pX$ + 7,7 =a,n, (5) 


where 7 is the real value of the transfer in terms of the first commod- 
ity. If any part of the transfer were given to foreigners within the 
home country, the chances for a decline in W” would simply be 
enhanced, thereby strengthening the argument below. 

To examine the welfare implications of the transfer, differentiate 
equations (1), (3), (4), and (5) totally with respect to 7—assuming 
(without loss of generality) that initially Ui = 1, while noting that 

i/U4, = p = —(dFi/dp)/(dF3/dp) from the first-order conditions for 
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maximizing utility and profit. In this way, it is a straightforward 
exercise to derive 


dWifdr = 1 + (E‘dpldt),j = a,n, (6) 


where E? = X4 — C4. Consistent with figures 1 and 2, which depict the 
home country exporting the second good, E* > 0 by assumption 
throughout the present paper. As illustrated above, however, E” can 
be either positive (in fig. 1) or negative (in fig. 2). 

As equations (6) confirm, dW“%/d7r is the familiar sum of the follow- 
ing two components: the primary gain (= 1) from the transfer- 
induced increase in aggregate income, at the initial (pretransfer) set 
of relative prices; plus the secondary effect (= E“dp/dz) from the 
possible increase or decrease in the real exchange value of the initial 
volume of home exports, in the event of a transfer-induced change (if 
any) in relative prices. The expression for dW"/d7 is analogous. If 
foreign inputs were entirely absent from the home country, the dis- 
tinction between national and aggregate variables would disappear, 
thereby implying that E” = E* and (hence) that dW"/dr = dW®/dr. 
Given the actual presence of factor inputs from abroad, however, 
dW"/dr generally differs from dW“/dr, except in the special case where 
either E” = E*% (despite the foreign presence) or dp/dr = 0. 

To determine precise conditions for the direction of change in 
welfare, consider the standard transfer-induced terms-of-trade re- 
sponse, analyzed previously by Samuelson (1952, 1954) and sub- 
sequently by Mundell (1960). Thus, by well-known reasoning, 


dplidr=il =m — m*)\ le + ee LE, (7) 


where e (> 0) and m denote the relative-price elasticity of import 
demand and the marginal propensity to consume the importable, 
respectively, for the home country; e* (> 0) and m* denote the 
corresponding variables for the rest of the world; and 7 = 0 in the 
initial (pretransfer) equilibrium.’ Given the assumption above that 
world commodity-market equilibrium is stable, e + e* > 1 throughout 
the present paper. 

If equation (7) is substituted into equations (6), simple manipulation 
confirms that 


dW4/dr = (c + e*)/(e + e* — 1) >0, (8) 


6 If preferences in consumption were allowed to differ between nationals and foreign- 
ers within the home country, it would be necessary to rewrite eq. (7) as follows, to reflect 
the present assumption that the entire transfer goes exclusively to nationals: 


dpldr = (1 — m™ — m*)/(e + e* — 1)E%, (7') 


where m” denotes the national marginal propensity to consume the home importable. 
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but shows that 
dWdr = 0 as (e + e* — 1)E* S (m + m* — IDE", (9) 


where e€ (> 0) and e* (> 0) denote the compensated (constant-utility) 
relative-price elasticity of import demand for the home country and 
the rest of the world, respectively, while e = € + m and e* = e* + m*, 
according to a standard decomposition.” Although dW%/d7 > 0 unam- 
biguously, it is evidently possible to have dW"/dr < 0 nevertheless.® 

To highlight the important role of the differential trade-pattern 
and differential trade-volume phenomena, it is helpful to revert to 
equations (6), which imply that a fall in national despite the rise in 
aggregate welfare can occur only if (E" — E*)dp/dr < 0. This necessary 
condition for a fall in W” holds if either dp/dr < 0 in presence of the 
differential trade-volume phenomenon of figure 1 (where E” > E* > 
0) or dp/dr > 0 in conjunction with the differential trade-pattern 
phenomenon of figure 2 (where E” < 0 < E*).® Correspondingly, if 
home exportables were relatively intensive in their use of capital 
(rather than labor), a transfer-induced deterioration in national 


7 Alternatively, if eq. (7’) from n. 6 above were substituted into eqq. (6), simple 
manipulation could show that 


dW4/dr = [(e + e*) + (1 — y)(mf — m/e + e* — 1) (8') 
and 
dWldz = [(e + e*) + (1 — y)@nf + m* — 1)]/e + e* — 1), (9’) 


where mf denotes the marginal propensity to consume the home importable for foreign- 
ers within the home country; y = (C? — X%)/(C¢ — X9) = E"/E%; and use is made of the 
fact that m = ym" + (1 — y)m’. Eq. (8’) indicates that dW? /dr can be decomposed into twa 
comparative-static components. As could be shown readily, the first component [(e€ + 
e*)/(e + e* — 1)] is the transfer-induced change in W* that would occur initially if the 
transfer were given temporarily to nationals and domestically located foreigners in the 
respective amounts yr and (1 — y)r, whereas the second component [(1 — y)(m/ — m”)/¢ 
+ e* — ])] is the subsequent change in W? that would occur as the portion (1 — y)r was 
passed from domestically located foreigners to nationals (the ultimate recipients of the 
entire transfer). Eq. (9') could be interpreted analogously, since —dW"/dr equals the 
worldwide sum of transfer-induced changes in welfare for everyone excluding home- 
country nationals, as could be shown readily. 

* Under the present assumption that mf = m" = m, eq. (8) of n. 7 above is equivalent 
to eq. (8), while eq. (9’) leads directly to condition (9). Alternatively, if it were the case 
that m/ # m”, there would arise the new possibility of having dW4/dr < 0 in eq. (8'). Also 
if it were supposed that mf = 1 — m*, it would be the case that dW*/dr > 0 unambigu- 
ously in eq. (9’). This last result can be understood intuitively as follows: If foreign 
tastes are uniform throughout the world, the reasoning behind eq. (8) shows equally 
well that the transfer must lower worldwide foreign welfare; that is, —-dW"/dr < 0, 
recalling n. 7 above. Incidentally, in view of the fact that worldwide foreign welfare 
otherwise can rise (when dW"/dr < 0) if mf # m*, international aid might be especially 
attractive for a donor country with investments in the aid-receiving economy. 

® Although (E” — E*)dp/dr < 0 also if 0 < E” <E* when dp/dr > 0, dW"/dr > 0 in this 
case, as implied by eqq. (6). The reader may also see alternatively that, from condition 
(9), dW"/dz < 0 if and only if (e + e*)E" + (E* — E")(e + e* — 1) < 0. Therefore, national 
welfare may decline despite the increase in aggregate welfare if E" < 0 < E® (i.e., the 
differential trade-pattern phenomenon holds) or if E" > E* > 0 (i.e., the differential 
trade-volume phenomenon holds). 
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(though not in aggregate) welfare would still be possible, provided 
that either the aggregate terms of trade improve in the case of labor- 
abundant nationals or an aggregate terms-of-trade decline occurs in 
the presence of capital-abundant nationals. 

Consequently, the basic results of this section can be summarized 
generally in the following terms. When the home exportable uses 
intensively the factor that is relatively abundant in the national (as 
compared with the aggregate) endowment, the national and aggre- 
gate patterns of trade are the same, in which case a fall in national 
welfare might occur through a differential trade-volume phenome- 
non if the (national and aggregate) terms of trade worsen unambigu- 
ously. Alternatively, when the home exportable uses intensively the 
factor that is relatively scarce in the national (as compared with the 
aggregate) endowment, the aggregate and national patterns of trade 
could differ, in which case the differential trade-pattern phenomenon 
might give rise to a deterioration in national welfare if the national 
terms of trade worsen through an aggregate terms-of-trade im- 
provement. These general results, moreover, hold equally well for 
changes in p induced by economic expansion and tariff policy, as will 
be clear from the analysis below. 


IV. Economic Expansion 


As Bhagwati (1958a) has demonstrated, a once-for-all increase in a 
factor endowment or in a technological level might deteriorate the 
aggregate terms of trade enough to worsen aggregate welfare of the 
home country, but this immiserizing growth can occur only if either 
the rest of the world has an inelastic offer curve or growth would 
decrease the production of home importables at the initial product- 
price ratio. In other words, if the offer-curve elasticity for the rest of 
the world is not less than unitary and economic expansion is not 
“ultrabiased” against the production of home importables, the 
growth-induced change in W* cannot be negative. Even under these 
circumstances (assumed throughout the present section) which pre- 
clude a fall in aggregate welfare, however, the following analysis 
demonstrates that a (large) country might suffer a loss in national 
welfare. This demonstration of a growth-induced decline in W”, 
moreover, does not even require a rise in the relative price of home 
importables. 

To allow for factor-endowment expansion or technological ad- 
vance, equations (1) may be rewritten as follows: 


Xi,= Fi(p, 0),2 = lL; 2,j7 =a,n, (10) 


where @ is a general shift parameter, a rise in which indicates either a 
factor-endowment increase (for K’ or L*) or a _ disembodied 
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technological improvement for an industry (one or two). It is assumed 
that any addition to the aggregate supply of capital or labor is owned 
fully by nationals. If any part of such addition were foreign owned, 
the likelihood of a decline in W” would simply be enhanced, thereby 
strengthening the argument below. However, the ability of domesti- 
cally located producers to take advantage of disembodied technologi- 
cal progress should be independent of the source of ownership of the 
inputs used, as assumed here. 

Differentiating equations (2), (3), (4), and (10) totally with respect to 
6, while recalling that Ui, = 1 initially and that U}/U{, = p = —(6F 4/dp)/ 
(OF4/dp), we readily obtain the following result: 


dWildo = Yi + (Eidpld@),j =a,n, (11) 


where Yi, = 0Y4/00 > 0. Thus, each dW‘/d6 is the sum of a primary 
growth effect (¥4) plus a secondary relative-price effect (Eidp/d 6), 
which are analogous to the welfare-related effects of the transfer 
mentioned above in Section III. Although national and aggregate 
welfare again would remain equal if foreign inputs were entirely 
absent from the home country, the actual presence of foreign 
ownership gives rise to the possibility of having dW"/d@ < 0 when 
dW/d@ > 0, except in the special case where dp/d@ = 0. 

Turning to the standard growth-induced terms-of-trade response, 
analyzed previously by Bhagwati (1958)) and subsequently by Kemp 
(1969, p. 110), we see that it is a well-known fact that 


dpld@ = (8 — m)Y$/(e + e* — 1)E%, (12) 


where B = (0X4/060)/Y$. When this result is substituted into equations 
(11), straightforward manipulation confirms that 


dW4/d0 = (e +B +e* — DYGV/eE+e*%—1)>0 (13) 
but shows that 
dW/d0 SO asE“e + e* — 1)¥3 SE%(m — B)Y4, (14) 


where B 2 0, which recalls the assumption that growth would not 
reduce production of home importables at the initial commodity- 
price ratio; and e* 2 1, which recalls the assumption that the rest of 
the world’s offer curve is not inelastic. Thus, despite the fact that 
dW*/d@ > 0 unambiguously under these circumstances, it is still possi- 
ble to have dW"/d6 < 0 nevertheless.'° 


© Along lines suggested by nn. 6-8 for the case of international transfer, the analysis 
of economic expansion could be extended readily to distinguish between m” and m¢. It is 
worth noting, however, that it would still be possible to have dW"/d6 < 0 even if it were 
the case that mf = 1 — m*. 
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Equations (11) imply that a fall in national despite the rise in 
aggregate welfare can occur only if (E” — E*)dp/d@ < Y$ — Y3. As could 
be shown readily, this necessary condition for a fall in W” can result 
from each of the following alternative events, for example: an in- 
crease in either the national endowment of capital or the technologi- 
cal level of industry one, with dp/d@ > 0 in the presence of the 
differential trade-pattern phenomenon (fig. 2); and an increase in 
either the national endowment of both factors or the technological 
level of both industries, if dp/d@ < 0 with the differential trade- 
volume phenomenon (fig. 1).!! Correspondingly, if home exportables 
were relatively capital intensive, it would be possible to have an 
expansion-induced deterioration in national (though not in aggre- 
gate) welfare under a variety of circumstances, including the follow- 
ing: an increase either in the national stock of labor or in the level of 
technology for the production of importables, when the national 
endowment is labor abundant; or an increase either in the national 
endowment of both factors or in the level of technology for both 
sectors, when nationals are capital abundant. 


V. Tariff Policy 


According to a standard result in the literature (see Bhagwati 1968), 
free trade is ranked superior to both no trade and subsidized trade 
(assuming that both offer curves are well behaved),’” from the view- 
point of aggregate welfare. In other words, the home country cannot 
increase W? above the free-trade level either by using an import (or 
export) tax to eliminate trade or by imposing an export (or import) 
subsidy to encourage trade. From the national-welfare point of view, 
however, the ranking above may be reversed. Since Bhagwati and 
Brecher (1980) already demonstrated the possibility of such a reversal 
for free trade versus autarky, the following analysis concentrates on 
free versus subsidized trade. 
To allow for tariff policy, equations (2) may be modified as follows: 


Yi =X} + pXi+ [(Cf - Xf) al(1 — @)],j = an; (15) 


where a denotes the ad valorem tariff, which is an import tax (if a > 0) 
or an import subsidy (if a < 0); the domestic relative price of the 


11 Although Y$ = Y with a national factor-endowment increase, it is possible that Y4 > 
Y4 for a technological advance. Thus, with, the latter (but not the former) type of 
economic expansion, a fall in national welfare despite the rise in aggregate welfare 
might occur even without the differential trade-volume and differential trade-pattern 
phenomena—if both industries experience the technological advance and dpidé < 0. 

127 e., the offer curve is assumed to represent imports as a monotonic decreasing 
function of their relative price in world markets. For the significance of this assumption 
in tariff analysis, see Bhagwati and Kemp (1969). 


508 JOURNAL OF POLITICAL ECONOMY 


second good is still denoted by p, so that the relative price of this good 
in world markets is now equal to p(1 + a); and (C¢ — X%)a/(1 + a) 
equals the real value (in terms of the first good) of tax revenues or 
subsidy payments, evaluated at domestic prices.'* In writing equations 
(15), it is assumed (for the sake of simplicity) that all tax revenues or 
subsidy payments, respectively, are returned to or collected from 
national consumers in lump-sum fashion. If foreigners within the 
home country were to receive or finance any part of these revenues or 
payments, respectively, comparison of the free-trade and autarkic 
equilibria (which generate no tax revenues) clearly would be unaf- 
fected, while the chances of having free trade inferior to subsidized 
trade simply would be enhanced (thereby strengthening the analysis 
below). 

To show that free trade might be inferior to subsidized trade from 
the national point of view, it is sufficient to establish the possibility of 
having dW"/da < 0 in free-trade equilibrium. Consequently, through- 
out the following discussion, let a = 0 in the initial (pretariff) 
equilibrium. 

Differentiating equations (1), (3), (4), and (15) totally with respect to 
a and again recalling that U, = 1 initially and that U}/U, = p = 
— (dFi/dp)/(aFi/dp), we may verify readily that 


dW? /da = pE* + (E’dp/da),) = a,n; (16) 


note that pE* = CY — X¢ when (balanced) trade is initially free (with 
a = 0). By well-known reasoning (see Kemp 1969, p. 96), 


apida = pl =m =e. \e ae 1). (C7) 


note that 1 — m equals the home country’s marginal propensity to 
consume the exportable and recall that a = 0 initially. When this 


result is substituted into equations (16), simple manipulation confirms 
that'* 


daWda = epEt(e + e* — 1) 20 (18) 
but shows that 
dWda = 0 as (e + e* — 1)E* S (m + e* — 1)E™. (19) 


Thus, despite the fact that dW“/da 2 0, it is evidently possible to have 
dW"/da < 0 nevertheless.'® 


13 Although the corresponding value at world prices would be (C¢ — X%)a@, consumers 
respond directly to domestic (tariff-inclusive) prices instead. 

* Note that dW%/da = 0 only in the small-country case where e* = », and even then 
the change in W* does not equal zero for any discrete change in a, by well-known 
reasoning. 

* Along lines suggested by nn. 6-8, the analysis of tariff policy could be extended 
ee) to a m’ # m”, without eliminating the possibility of having dW"/da < 0 even if 
ie = he 
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As implied by the equations (16), it is possible to have dW"/da < 0 
(even though dW%da cannot be negative) if either a differential 
trade-volume phenomenon arises (fig. 1) when dp/da < 0 (the “nor- 
mal” price response) or a differential trade-pattern phenomenon 
occurs (fig. 2) when dp/da > 0. (The “perverse” price response [dp/da 
> 0] can occur only in the large-country case, under conditions 
discussed by Metzler [1949].) Correspondingly, if home exportables 
were relatively capital intensive, it would be possible to have dW"/da 
< 0 (even though dW%/da cannot be negative) if either nationals are 
labor abundant when dp/da > 0 or nationals are capital abundant 
when dp/da < 0. 

Thus a trade subsidy might raise national (but not aggregate) wel- 
fare above the free-trade level. This analysis of a small subsidy (tax) 
on trade, moreover, complements the discussion of Bhagwati and 
Brecher (1980), who concentrate on prohibitive taxes on trade and 
thus are able to avoid the issue of tariff revenues. 

The analysis of this section has an important implication for the 
traditional method of estimating the cost (benefit) of tariff protection 
or trade liberalization. Since the conventional method (as outlined by 
Johnson [1960]) ignores the source of ownership of domestically 
located inputs, the concept measured (in present notation) is clearly 
dW4/da rather than dW"da. Thus, the traditional estimate of the 
impact of protection or liberalization is an aggregate measure, which 
overstates or understates the national cost (benefit) if (E" — E“)dp/da 
S 0, respectively, as suggested by equations (16). This misstatement 
arises because the conventional estimate simply sums the three stan- 
dard components (namely, the external terms-of-trade effect and the 
costs of distortion in both production and consumption), while it fails 
to exclude the foreign-factor portion of the tariff-induced change in 
aggregate welfare.'® 


VI. Summary 


As demonstrated by this paper, welfare aspects of international trade 
theory need to be reconsidered, when national and aggregate income 


16 The foreign-factor portion of the change in aggregate welfare is represented by 
the expression (E¢ — E")dp/da, which must be excluded from dW“/da to give dW"/da, as 
suggested by eqq. (16). Also by repeating the procedure of Bhagwati, Ramaswami, and 
Srinivasan (1969), it is possible to write (in present notation) that dW%/da = (E“dp*/ 
da) + [(p* — p)dX$/da] + [(p — p*)dC$/da], where the relative price of the second good 
in world markets is denoted p*, which equals p(1 + a). The components E“dp*/da, (p* — 
p)dX$/da, and (p — p*)dC{/da are the effects due to the terms-of-trade change, the 
production distortion, and the consumption distortion, respectively. When evaluated in 
free-trade equilibrium (where a = 0 and p = p*), the latter two (distortion-related) 
components disappear, leaving only the first (terms-of-trade) component. This re- 
maining (first) component, moreover, is equivalent to the right-hand side of eqq. (16) 
for j = a, since (in free-trade equilibrium) dp/da = (dp*/da) — p. 
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differ in the presence of foreign ownership. Examples of this need 
are provided by the analysis of international transfer, economic ex- 
pansion, and tariff policy. For a country receiving a transfer from 
abroad, national (but not aggregate) welfare might deteriorate even 
when international commodity-market equilibrium is stable, regard- 
less of the direction of change in the world product-price ratio. In the 
case of economic expansion from factor-supply growth or technologi- 
cal advance, national (but not aggregate) welfare might worsen even 
when the rest of the world does not have an inelastic offer curve 
and domestic expansion is not ultrabiased against production of 
home importables, no matter what the direction of change in the 
world commodity-price ratio. As for tariff policy, free trade might 
be ranked inferior to both no trade and subsidized trade (in either 
direction), from the viewpoint of national (but not aggregate) welfare. 
Moreover, the conventional empirical estimates of the cost of protec- 
tion a la Johnson’s (1960) methodology are generally seen to be in 
need of correction if the economy has foreign-owned factors of pro- 
duction. In fact, many economies typically do have substantial labor 
inflows under gastarbeiter or other programs defined by immigration- 
quota policies, and, of course, equally there are substantial flows of 
capital among nation states." 
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Accounting for Price Changes: 
American Steel Rails, 1879-1910 





Robert C. Allen 


University of British Columbia 


A framework is developed for decomposing product price changes 
into changes in input prices, technical efficiency, and deviations of 
price from unit cost. This framework facilitates the measurement 
of productivity growth in noncompetitive industries. The history of 
American steel rail prices between 1879 and 1910 is analyzed, and it 
is concluded (in contrast with much recent work) that productivity 
growth remained rapid until the twentieth century and that the steel 
industry was sufficiently collusive so that the rail producers received 
the benefits of that productivity growth as excess profits. 


I 


The 1890s mark a pronounced change in the position of the Ameri- 
can steel industry in the world market. Before the 1890s American 
prices substantially exceeded British prices, and the American indus- 
try achieved a large size only because of high tariffs. During the 1890s 
American prices dropped to British levels or below, and America 
emerged as a major exporter of iron and steel. This change in price is 
apparent in the case of rails, which was one of the leading steel 
products of the time. Between 1881 and 1890 the average price of 
steel rails at Pennsylvania mills was $37.01 while the average British 
price was $23.62. During the period 1906-13 the American price had 
fallen to $28.00 while the British price had risen to $29.46 (Allen 
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1979). To explain the emergence of the United States as a steel 
exporter, it is necessary to explain the fall in American steel prices.! It 
is the aim of this paper to account for the decline in the price of 
Bessemer steel rails. 

There is no shortage of possible explanatory factors. During the 
late nineteenth century low-cost iron ore fields—particularly the 
Mesabi in Minnesota—were exploited, and improvements in lake 
shipping services were effected. Both developments substantially low- 
ered the cost of iron ore in the Midwest and hence the cost of 
Bessemer pig iron, which was the principal input in steel rails. 
Moreover, many observers at the time believed the rate of productiv- 
ity growth to have been very high in the American steel industry. This 
development would also be expected to have lowered the price of 
American steel. On the other hand, there were several attempts on 
the part of producers to collude and thus raise the price of steel rails. 
With the formation of U.S. Steel in 1901 the difficulty of enforcing 
collusive agreements was reduced. In the case of steel rails there is 
prima facie evidence that producers subsequently exercised monop- 
oly power. Prior to 1901, the price of steel rails had changed fre- 
quently. In that year, however, the price of steel rails was set at $28 
per ton, and it remained unchanged until the First World War. Such 
stability was unprecedented in the industry and indicates some power 
to control the price but does not in itself prove that price was set above 
cost (Temin 1964, pp. 192, 284-85). 

The procedures that economic historians have used to analyze price 
changes have not been fully satisfactory. The technique most firmly 
rooted in theory is the application of an identity derived by Jorgenson 
(1966, pp. 2-4). The identity asserts that the rate of change of the 
price of the product equals an index of the rates of change of input 
prices minus a residual which is usually interpreted as the rate of 
technical progress. McCloskey (1973, p. 99) used a relative levels 
version of this identity to argue that—since the ratio of the price of 
rails to the price of pig iron (which was regarded as a proxy for the 
index of input prices) was trendless after the 1880s—there was no 
productivity growth after that date? The difficulty with this proce- 
dure is that the identification of the residual with technical progress is 
correct only if the industry is in competitive long-run equilibrium. 
This assumption was barely mentioned in Jorgenson’s discussion of 
the identity,? and, indeed, it is not immediately obvious from the 


1 It is also necessary to explain the rise in British prices. This aspect of the problem is 
considered in Allen (1979). te 

2 Temin (1964, pp. 218-19) develops a similar argument. 

3 The discussion is confined to Jorgenson (1966, p. 4, n. 1). 
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mathematics that the assumption has been invoked. However, a 
scheme which decomposes price changes into efficiency and input 
price effects only if the industry is in competitive long-run equilibrium 
is not a satisfactory framework for analyzing American steel prices in 
the late nineteenth and early twentieth centuries. 

Reflection suggests that changes in market power influence the 
difference between input and output prices. This realization has also 
been exploited by economic historians analyzing the evolution of steel 
prices. Thus McCloskey (1973, pp. 24-28) examined the ratio of rail 
to pig iron prices in order to investigate the effectiveness of various 
attempts at collusion by rail producers in Britain. Temin (1964, pp. 
186-92) undertook similar investigations for the United States. 
Clearly, changes in monopoly power and technical progress cannot 
both be inferred from the ratio of output to input prices. At best, only 
their combined effect can be identified. 

What is required to account for the fall in the price of American 
steel is an accounting scheme that identifies the separate effects of 
input price changes, efficiency changes, and changes in the deviation 
of price from unit cost. Such an accounting scheme is developed in 
Section II for a single product firm (an analogous scheme for a 
multiproduct firm is developed in an appendix).* In Section III the 
scheme is used to account for the fall in the price of steel rails between 
1879 and 1910. The data discussed there indicate that declines in 
input prices and increases in efficiency both exerted powerful down- 
ward pressures. Indeed, the rate of technical progress was at its 
highest in the 1890s. The greater effectiveness of collusion after the 
U.S. Steel merger, however, meant that these cost declines were not 
entirely reflected in lower prices. Indeed, profit margins were raised 
enough between 1889 and 1902 so that rail producers realized most 
all of the gains of technical progress as higher profits. In Section IV 
these conclusions are compared with McCloskey’s and Temin’s 
findings. Jorgenson’s price accounting identity is examined in detail 
in order to make explicit the necessity of assuming competitive long- 
run equilibrium if one tries to infer the rate of productivity change 
from differences in the rates of change of input and product prices. It 
is also shown that substantial errors result if this assumption is er- 
roneously maintained. 


4 The author has extended the price accounting framework discussed here to the case 
of a multiple output firm. The multiple output case is developed in Allen (1978, pp. 
7-11) and in an appendix to this paper which is available from the author upon 
request. 
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To develop the price accounting scheme for a single product firm, it is 
assumed that the firm minimizes costs subject to the following pro- 
duction function: 


Qd) =A - fix], (1) 


in which Q(¢) is output at time ¢, X(t) = [X,(), X,(), . . . , X,(¢)] is the 
vector of inputs used at time ¢, and A() is an index of Hicks neutral 
technical progress. The assumption of Hicks neutrality can easily be 
relaxed (Diewert 1980, pp. 489-95). The function f is assumed to be 
linearly homogeneous. If f were known, the relative increase in effi- 
ciency between ¢, and t, could be computed from Q(¢,), Q(tz), X(,), and 
<e): 





Al) _ __ Q(V/QU) 5 
A) FXG 


The term Q(t,)/Q(t,) is the relative increase in output that actually 
occurred. The term /[X(t)/f[X(t,)] = A) + f[IX(t.) VAG) - fIX@)] is 
the relative increase in output that would have occurred in the ab- 
sence of technical change but given the change in input quantities that 
actually occurred. Thus A (t,)/A(t,) equals the proportional increase in 
output that is not attributable to the increase in inputs. 

If the firm minimizes cost, then there exists a total cost function C 
which is factorable into a unit cost function c* multiplied by the 
production rate. Under the assumption that technical progress is 
Hicks neutral, the unit cost function can be written as a function of 
the input prices divided by A(¢): 


Clt,W(t), QM] = c*[t4W@)] - Q@) = LW] - Q@/A(). (3) 


Here W(t) = [W,(¢), W(t), ..., W;,(t)] is the vector of k input prices. It is 
presumed that the supply to the firm of X(¢) is perfectly elastic at W(t). 
The right-hand equality implies that the relative change in unit costs 
can be decomposed into an efficiency component and an input price 
component: 

C¥ [to Wte)) _ Ath) , clW(ty)I 


lt, W)) Ads)  clW,)I ) 





The term A(¢,)/A(¢,) is relative efficiency in the two periods and 
clW (tp) Vc[W,)] is the relative increase in unit cost that would have 
obtained had there been no productivity growth but given the actual 
change in input prices. Equation (4) provides a second method for 
measuring productivity growth. Ifc were known, then cLW (tz) \/c[W (t,)] 
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could be computed by direct substitution. Dividing that result by the 
observed change in unit costs would yield A(t,)/A(t,). From the per- 
spective of equation (4), the increase in efficiency equals the relative 
decline in unit costs not attributable to declining input prices. 

The price of the product will always be related to unit costs by the 
following identity: 


P(t) =M@- c*tWOI, (5) 


where M(t) is the implicitly defined markup of price over cost. Fora 
competitive industry in long-run equilibrium, M(t) = 1. One would 
expect to observe M (¢) usually not equal to one. The relative change in 
price between ¢, and ¢, can be written as 


P(tz) _ M(ts) , c*[ty,W (te) 


Pi) “MG? ehW@) (6) 








Equation (6) could be used to decompose a change in price into a cost 
change and a markup change. Equations (2) and (4) provide a basis 
for decomposing the cost change into input price and efficiency 
components. 

If c and f were known, equations (2), (4), and (6) could be used to 
decompose price changes into markup, efficiency, and input price 
changes. In an empirical study, of course, the functional forms of c 
and f would have to be specified and the parameters of the function 
estimated. Prodigious task! Recent work in index number theory (e.g., 
Diewert 1976, 1980) has shown how such elaborate undertakings can 
be circumvented. Corresponding to each of the functional forms that 
would likely be used to specify c or f are one or more index number 
formulae. The indexed input quantities (or input prices) would indi- 
cate exactly f[X (tz) Vf [X(¢,)] or c[W (@,) /c[W (,)] on the assumption that f 
or c had the functional form that corresponded to the index number 
formula used. For example, f[X (t,)|/f[X(t,)] would equal a Laspeyres 
or Paasche index of the input quantities if f were Leontief. If f were 
Cobb-Douglas, f[X (t2)]/f[X (t;)] would equal a geometric index of input 
quantities. Iff were a quadratic function, f [X (t,) /f[X (¢,)] would equal a 
Fisher ideal index, and if f were translog, then f[X(,)/f[X(¢,)] would 
equal a Térnqvist index of the input quantities.? Analogous results 


5 These equivalences are discussed in Diewert (1980, pp. 446-52). The linearly 
homogeneous translog function, which is used extensively in this paper, is 


k k k 
In fIX@)] = a» + > Bln Xe) + 1/2 > Aen de Xin X20), 


subject to the restrictions: 2B inl Grn tea l> a3 eS ORL OL nite aetna Ce 
Diewert (1976) has shown that 
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hold forc and input price indices. It should be noted that in the index 
number approach to the measurement of f[X (ty) ]/fLX (t,)] or c[W (t2)]/ 
c[W(t,)] it is not necessary to know the values of the parameters of 
f orc. Instead, the index number formulae require the values of the 
factor shares, estimates of which are frequently easier to obtain than 
estimates of the parameters of f or c. 

In an empirical study, one should choose the formula for indexing 
input prices or quantities that corresponds to the true structure of c 
and f. In the absence of any prior information on the true structure, 
the choice of an index number formula must be made on the basis of 
which possible specification of f or c would best approximate the true 
structure, whatever it is. Since Leontief and Cobb-Douglas functions 
can be regarded as first-order approximations to f or c, while qua- 
dratic and translog functions can be regarded as second-order approxi- 
mations, one would expect the quadratic and translog functions to fit 
the data more closely. Hence, there are grounds for preferring the 
Torngqvist and the Fisher ideal indices to the Laspeyres, Paasche, or 
geometric. Since the Fisher ideal and Tornqvist indices are exact for 
functions that can be regarded as second-order approximations to 
any linearly homogeneous production, cost, or expenditure function, 
Diewert (1976) has termed the Tornqvist and Fisher ideal indices 
“superlative.” In the remainder of this paper, translog functions and 
Tornqvist indices will be used. 

There are two ways to proceed, depending on whether one mea- 
sures A (t.)/A(t,) with equation (2) or (4). If one uses equation (2), one 
presumes f is translog, so f[X (tz) /f [X(t,)] equals a Tornqvist index of 
inputs. Hence, 


A(t) ___ QWiQs) _ pee I i 
A) A Xa) Qt IX at) 
La 
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Consistency requires that c[W(¢,)/c[W(¢,)] be measured implicitly by 
substituting A (t,)/A(t,) from equation (7) into equation (5): 


k 


c — Clb Wy)l , (ty) /Xj (ty) fi 
C[W (tz), W(t,)] c*[t,,W (t,)] i ean . (8) 
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which is a Tornqvist index of X(t). Here, s,(¢) is the share in costs of X,(t). 
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Rearranging equation (8) indicates that the change in unit costs can be 
expressed as 


k rt 
[td] _ QUteVXilt) Fe ; 
or eo er Il OOK | ) 





Substituting equations (7) and (9) into equation (6) provides one 
scheme for decomposing product price changes into input, efficiency, 
and markup components: 








P(t) _ Mi), AQ) . awe, We)). (10) 
P(t,) M(t) A) cw hie 
If one uses equation (4) to measure productivity growth, a slightly 
different accounting scheme emerges. In this case, one presumes ¢ is 
translog, so c[W (t,) VclW (t,)] equals a Tornqvist index of input prices: 





UAC) ae te y (11) 
c(Wt)] gL Wilts) 
Equation (4) can then be used to implicitly define an efficiency index: 
; “Tf Wilts) Fe /c*lte, W(t) 
= ‘ : : 1 
att) = ariey| ‘eure wa 


Analogously to equation (9), one can decompose unit cost changes as: 
k 


c*[t,,.W(te)) _ W i(te) Ii_,- 
c*[t,.WG)) _ Uhre /A (ty tx). (13) 





Substituting equation (13) into equation (6) provides the second price 
accounting scheme: 


P(ts) _ M(te) TT [ Wilta) F 7 
P(t.) M(t) 4 aa ‘A(las). ! 








Equations (10) and (14) are each internally consistent but differ in 
the presumptions they make about the way the data were generated. 
In equation (10) the presumption is that the data were generated by 
minimizing costs subject to a translog production function. In equa- 
tion (14) the presumption is that costs were minimized subject to the 
production function which generates a translog cost function. Since a 
translog production function does not yield a translog cost function 
(and conversely), different accounting schemes follow from the dif- 
ferent assumptions. However, since translog production and cost 
functions both provide second-order approximations to the technol- 
ogy, one would expect the two accounting schemes to give similar 
results. That, indeed, is true in the case of steel rails. 
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Table 1 presents details of the costs of producing steel rails in the 
United States in 1879, 1889, 1902, and 1910. The figures for the 
latter two years are derived from the U.S. Commissioner of Corpo- 
rations (1913) investigation into costs, prices, and profits in the steel 
industry. The commissioner obtained and published details of the 
cost of producing most steel products made in the United States 
between 1902 and 1910. The figures for 1902 are averages for firms 
producing virtually all of the heavy steel rails made in America in that 
year. The 1910 figures are averages for all U.S. Steel’s Bessemer rail 
mills. These mills produced 60.2 percent of American production in 
that year. The cost figures for 1901 and 1903-9 that the bureau 
reports are quite similar to the 1902 and 1910 figures, so conclusions 
founded on those years are indicative of conditions throughout the 
first decade following the U.S. Steel merger.® 

The cost figures for 1879 and 1889 are less certain than those for 
1902 and 1910. The 1889 figures are based on the U.S. Commissioner 
of Labor’s (1891) survey of steelmaking costs in the United States, 
Great Britain, and the European continent. Although the cost esti- 
mates were based on the cost accounts of steel firms, the coverage was 
not as broad as the U.S. Bureau of Corporations survey. Moreover, 
the U.S. Commissioner of Labor (1891, p. 163) Report expressed 
concern that some of the leading American rail mills had refused to 
supply cost information. The 1879 figures are also from an interna- 
tional comparison of production costs. This survey was conducted by 
Alexander L. Holley, who was the leading American steelworks en- 
gineer. The survey was published privately by the Bessemer Associa- 
tion, which owned the American rights to the Bessemer patents, for 
the use of its licensees. Holley visited several European plants in 1879 
and reported their costs. For comparison, he provided “representa- 
tive costs” in the United States. It is the latter costs which are used in 
table 1. Given Holley’s knowledge, the audience he was writing for, 
and the confidential nature of the communication, one presumes the 
American cost figures are indeed representative.’ 


6 See U.S. Commissioner of Corporations (1913, pp. 209-10, 461) for 1910 rail 
production figures and pp. 143-49, 209-16, 346-68 for information on costs in the 
other years. 

7 It should be noted that none of these sources reports the quantity and price of all 
inputs consumed. That practice was followed in the case of metallic inputs, but in the 
cases of fuel, labor, miscellaneous materials, and capital, only the expense per ton was 
recorded. It was necessary to estimate input prices and implicitly input quantities from 
other sources. Those sources are discussed in the notes to table 1. It must also be 
stressed that all calculations reported in this paper assume that the costs in table | are 
minimum long-run average total costs. 
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Between the 1880s and early 1900s the American steel industry 
became internationally competitive because the price of steel in the 
United States fell sharply. Table 1 and tables 2 and 3, which report 
the index numbers discussed in the previous section, make clear the 
decline in price was associated with a sharp decline in steelmaking 
costs. According to table 2, steelmaking costs in 1889 equaled 77 
percent of costs in 1879. In 1902 costs were 81 percent of their 1889 
level. The decline in costs subsequently slowed, for costs in 1910 
equalled 97 percent of their 1902 level. 

Equations (9) and (13) provide schemes for decomposing the cost 
changes into efficiency and input price components. The results from 
using both equations are exhibited in table 2, and, as the reader can 
verify, the results are nearly identical. Between 1879 and 1889 falling 
input prices were the main cause of falling costs. Thereafter input 
prices rose slowly. Between 1889 and 1902 costs continued to fall 
because productivity growth was rapid. After 1902, the rate of pro- 
ductivity growth slackened and input prices rose, so the decrease in 
costs between 1902 and 1910 was meager. 

Equation (7) indicates that productivity increased at the rate of 0.6 
percent per year between 1879 and 1889. Between 1889 and 1902 
the rate increased to 1.6 percent and then declined to 0.8 percent in 
the period 1902-10. (Eq. [12] gives essentially the same results.) 
Throughout the period 1879-1910, the average product of labor 
continuously rose, and the average product of capital generally fell. 
Since the ratio of the wage rate to the rental price of capital consis- 
tently increased, the change in the average products of labor and 
capital might have been due to factor substitution. Moreover, recent 
econometric work suggests that technical change was labor augment- 
ing in this period (Cain and Paterson 1979). Technical change and 
factor substitution reinforced each other in determining the labor- 
capital mix. 

Productivity growth was highest between 1889 and 1902. What 
distinguished those years from the others was the marked rise in the 
average products of fuel and metallic inputs. Fuel consumption de- 
clined as the mixer (invented in 1889) was adopted by American 
steelworks. In earlier years, it had been impossible to use the molten 
pig iron from blast furnaces directly in Bessemer converters. The pig 
iron was cast and then remelted. By 1902 most Bessemer steel was 
made from molten iron conveyed directly from the blast furnace. The 
saving in fuel was impressive. The productivity of metallic inputs also 
increased due to developments in other areas of the steel mill, in this 
case due to the growth of open hearth steelmaking capacity. Bessemer 
shops and rolling mills generated a large volume of scrap in the form 
of poor rolls and sheared ends. These could not be recycled in 


‘sa[qeuea 1ayI0 Jo suoMtuYap ay) 10F Z aqui 9ag “read eNTUT ay UT aotId ay Or potiad aup jo seat yeuTULIA ay UT aoLId ay Jo Ones ay) St ("A)q/(7)d ‘1 PIE L—AIANOS 





























(GEGQ i ce (GPG) se (19G0'1) = = 000'I (SL40'1) -  (40g6)- (19201) — = 000'I 01-2061 
GHODEes sG66l)--" CLL =. “Sice: (G1Z0'1) w (2065) a0 SLE ee SLC: 2061-6881 
(ZL18°) - (686°) (08¥6') = 083L" (8618") mr (2666) == (0876) =n 08ce: 68-6L81 
£(%9) a1)? C3) Ves CONG oul Mand em ya ae. CDi eee aordag 
(aan | I @w | a Lona’) M2 My yw @)d 

(pF) OF Ag (01) ‘Oa 4d 








SHONVH) AM ONISOdWOOAG 


6 AIdVL 


(ZI) “ba 07 Burpsos98e (17%7) p puke ‘(T1) ba o9 Surpsosoe [("7) 41] /[(1) M1]? ‘(g) ‘ba 01 Burpsosze [("7) a4‘) M)2 ‘(L) “ba or 
Surpaoz.e painduros sem (?7) p7/("2) p 4ead PeNTUT ay? UT soo UN paatasgo 4q paptarp potiad ay} Jo wad [eUTUTIAI ay) UT (T 3]qQeI WOIZ) SOD WUN paasasgo st [(*7) A1°7] 42/72) M7]? 21 AGEL — aOUNg 




















(S#S0'1) : (6F36°) = 9FL6 (SL¥0'T) . (S06) = 9FL6 01-3061 
(FL10°1) . (F66L’) = €E18° (G1Z0'1) : (Z96L’) = GEIS" 2061-6881 
(ZL18°) . (L686) = 6L9L° (8618°) : (49€6°) ca 6L9L° 68-6L81 
L("9) 2 Cave Es (9) AN Ax? iene @)y _ (CD) M2 acraag 
[@) 12 I = (2) Mx DMC) MI? My [@) M7]x2 

(G1) ‘OF Ad (6) ‘OF 4d 








SHAONVH’) LSOT) LIN() DNISOdNOOAG 


6 ATAVL 


522 


ACCOUNTING FOR PRICE CHANGES 523 


Bessemer converters but could be remelted in open hearth furnaces. 
In 1889 only 11 percent of American steel was open hearth, but by 
1902 38 percent was open hearth (Temin 1964, pp. 272-73). The 
development of this process allowed the recycling of scrap. Tables 1 
and 2 follow the nineteenth-century accounting tradition of charging 
Bessemer rail mills for their net consumption of metallic inputs. The 
recycling of Bessemer scrap in open hearth plants thus shows up as a 
rise in the productivity of metallic inputs in Bessemer rail production. 

Table | indicates that the fall in production costs was not entirely 
passed on to consumers in lower prices. An apparent consequence of 
the U.S. Steel merger was to raise markups to the 30 percent range 
for at least the first decade after the merger. In 1879 and 1889 
markups had been much lower. 

Table 3 uses equations (10) and (14) to decompose price changes 
into markup, efficiency, and input price components. The price of 
rails in 1889 equalled 72.8 percent of the price in 1879. Some of the 
decline in price was due to a decline in the markup; however, most of 
the decline can be ascribed to falling costs. As indicated previously, 
declining input prices were the predominant cause of declining costs. 
The fall in input prices was also decisive for the decline in price. 

Between 1889 and 1902, costs declined 19 percent but rail prices 
declined only 4 percent. The discrepancy arose because markups rose 
18 percent. This result indicates a major consequence of the U.S. Steel 
merger. Steel prices in the early twentieth century were set at levels 
prevailing about 1890, and the steel producers absorbed all of the 
decline in costs (due to technical progress in the case of rails) as excess 
profits. 

Table 3 indicates that there was little change in price, costs, or the 
determinants of costs between 1902 and 1910. The steel industry was 
sufficiently collusive so that a markup of 30 percent over unit costs 
could be maintained for a decade. 


IV 


These conclusions regarding productivity growth and the market 
power exercised after the U.S. Steel merger are at variance with 
recent conclusions advanced by McCloskey (1973, p. 99) and Temin 
(1964, pp. 186-93, 218-20). Both noted that the ratio of the price of 
steel rails to the price of pig iron was trendless from the 1880s to the 
First World War. (This constancy is illustrated by the data in table 1.) 
Both inferred from these price data that productivity growth ceased 
in the 1880s, and Temin also inferred that market control after the 
U.S. Steel merger was not effective in raising the price of rails. 

To choose between McCloskey’s and Temin’s conclusions and those 
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advanced here, it is necessary to scrutinize the theoretical underpin- 
nings of the calculations employed. McCloskey’s procedure consists of 
the (approximate) application of an identity deduced by Jorgenson 
(1966, pp. 2-4). Although the assumption is not obviously apparent in 
the derivation, it is the case that this procedure only correctly mea- 
sures productivity growth (and, hence, accounts for price changes) for 
industries in competitive long-run equilibrium. This will be shown 
subsequently. It is worth considering the matter thoroughly, both to 
resolve the issue at hand and because Jorgenson’s identity has been 
widely used by economic historians to measure productivity growth 
and account for price changes.’ 

Jorgenson begins with the accounting identity that a firm’s revenues 
necessarily equal the total income earned by the firm’s inputs. In the 
case of a single product firm, 

k 


PO) AVG. (15) 
i=1 
where W*¥ are the accounting valuations of the inputs X;. Totally 
differentiating equation (15) with respect to time, dividing the result 
by equation (15), and regrouping terms gives: 


* k . k * * 
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Q » Sj X; a Sj W; P ° ( 6) 
Here the shares are s# = W#X,/.W#X;,. 
Jorgenson defined productivity to equal the ratio of output to total 
input: E = Q/X. Totally differentiating this expression with respect to 
time implies that the rate of change of efficiency equals the rate of 


change of output less the rate of change of total input: E/E = Q/Q — 
X/X. Defining X/X to equal ¥s#(X,/X;) implies 


Lge as : ae . es 





Jorgenson contended, moreover, that Solow (1957) deduced the 
left-hand equality from a production model. 
If correct, equation (17) has an important implication for produc- 


* North (1968, p. 954), Shepherd and Walton (1972, pp. 60-63), Harley (1973, p. 
376), and McCloskey (1973, p. 86) cite Jorgenson (1966). Sandberg (1974, p. 100) cites 
the thesis version of McCloskey (1973) for a proof of the equivalence of the G. T. Jones 
index of real cost and the right-hand side of eq. (16) of this paper and goes on to assert 
the equivalence of that index and Solow’s (1957) index. Most of these economic 
historians recognize that the identity is not a sound basis for measuring productivity 
change when long-run competitive equilibrium does not prevail, and a number of ad 
hoc adjustments are developed to deal with the problem. Lindert and Trace (1971) do 
develop productivity indices that exclude excess profits. 
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tivity measurement, namely, that productivity growth can be inferred 
from the growth of input prices less the growth in output price without 
any explicit account being taken of markups. This result justified McClos- 
key’s procedure and appears inconsistent with equations (10) and (14) 
in this paper. 

Equation (17) is an identity and hence always true. The difficulty 
with using it as the basis of productivity measurement is that E/E 
equals the rate of productivity growth in the Solow (1957) sense only 
if the firm (or industry) whose productivity growth is being measured 
is in long-run, competitive equilibrium. This condition was not made 
explicit in Jorgenson’s derivation. To see the importance of the com- 
petitive equilibrium assumption, recall Solow’s derivation. Solow 
(1957) assumes a production function of the form of equation (1) and 
identifies the rate of productivity growth with 4/A. Totally differ- 
entiating equation (1) with respect to time, dividing the result by 
equation (1), and rearranging allows A/A to be expressed as 


Ate OSORIO NOX) teeX, " 
hia tena ae ot) 


where the time symbol t is suppressed for simplicity. If the firm faces 
exogenous input prices and minimizes costs, then the necessary con- 





ditions for cost minimization can be rewritten as W/ZW;X; = (df! 
OX )/f (X). In that case equation (18) can be rewritten as 

AnG ‘On <P ee 

A O 2 Sj ee (19) 


where 5s; = W,X,/2W;,X;. 

Equation (19) and the left-hand equality of equation (17) are almost 
identical. They would be identical if s#* = s;. Since equation (17) is an 
accounting identity, one must make some assumptions about the 
institutional and accounting framework in order to determine if s* = 
5; Reasonable, simple assumptions are that inputs are available to the 
firm in infinitely elastic supply at prices W;,, any difference between 
revenues and the opportunity cost of inputs accrues as excess profit 
(or loss) to capital, and all inputs besides capital are valued in the 
firm’s accounts at their supply prices W;. Given these presumptions 
W* = W, for all inputs besides capital. For capital (which will be 
indexed as the Ath input), Wg = W,, if revenues exceed, equal, or fall 
short of the opportunity cost of the inputs. 

If the firm is in competitive, long-run equilibrium, revenues equal 
the opportunity cost of all the inputs and Wj = W,. In this case s* = 5;, 
and equation (17) correctly measures the rate of productivity growth 
A/A. Moreover, in this case, productivity growth can be inferred from 
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input and product price movements without taking markups into 
account. This result is consistent with the framework of Section II of 
this paper, for, if revenues always equal opportunity costs, M(t,) = 
M(t,) = 1 and the markup terms disappear from equation (14). Effi- 
ciency change can then be inferred from changes in product and 
input prices. 

Suppose, on the other hand, that the markup exceeds one. In that 
case revenues exceed opportunity costs, Wi > W,, and 2W#X; > 
SW,X,. Hence, sf > s, and s#* < s; for all inputs besides capital. Unless 
all inputs (including capital) grow at the same rate {s*(X;/Xi) # 
>s,(X,/X,) and the rate of productivity growth cannot be inferred from 
equation (17). The same conclusion follows if markups are less than 
one. Therefore, if markups do not equal one, equation (17) cannot be 
used to measure the rate of shift of the production function. In 
particular, one cannot infer the rate of productivity growth from 
differences in the rates of growth of input and product prices. 

Substantial errors result if equation (17) is used when markups do 
not equal one. The errors are apparent if equation (17) is applied to 
the data in table 1. That application requires that some discrete 
approximation be adopted. The most useful approximation is to 
approximate the time-varying shares s# by the average of the shares 
prevailing at times ¢, and fy, 5 = 1/2 [s#(¢,) + s#(é.)], and integrate the 
left-hand equality in equation (17) between ¢t, and ¢,. Exponentiating 
the integral yields: 

k 


El) _ (to /X (to) ¥* 
E(t) [| Qt V/Xi(t,) | (20) 
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If one treats equation (19) analogously, the result is equation (7). 
Again the only difference between the putative productivity measures 
is in the shares. Numerically, however, the differences are not incon- 
sequential. For the periods 1879-89, 1889-1902, and 1902-10 A(é,)/ 
A(t,) equals 1.0676, 1.2560, and 1.0747, respectively. For the same 
periods E(t,)/E(t,) equals 1.0408, 1.1904, and 1.1199. The term E(¢,)/ 
E(t,) understates the average annual rate of productivity growth by 40 
percent in 1879-89 and 24 percent in 1889-1902 and overstates the 
rate by 59 percent in 1902—10. Assuming long-run, competitive equi- 
librium when it does not obtain leads to serious errors in productivity 
measurement. 


Vv 


In this paper it has been shown that American rail prices fell in the 
late nineteenth century because production costs declined sharply. 
Two unexpected findings include the discovery that the rate of pro- 
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ductivity growth remained high into the twentieth century and the 
discovery that rail producers managed to maintain steel prices at a 
level 30 percent above average total cost for a decade after the U.S. 
Steel merger. Temin’s (1964, pp. 189-93) recent work on the merger 
concluded that the markets for rolled products remained competitive 
after the merger. That conclusion is certainly not true for rails. 
Similar evidence for other products suggests that markups were 
generally high.® A reassessment of the significance of the U.S. Steel 
merger in this regard seems in order. 

In order to establish these conclusions, it was necessary to develop a 
more elaborate price accounting scheme than those previously avail- 
able. As part of that effort, considerable attention was devoted to the 
circumstances under which one could infer the rate of productivity 
change from the difference in the rates of input price and product 
price changes. A correct inference can be made only if the industry is 
in competitive long-run equilibrium. If that condition does not obtain, 
serious errors are likely to result. These difficulties can be avoided if 
productivity growth is measured with equation (7) or (12). 
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Some Evidence on Cross-Sector 
Effects of the Minimum Wage 





George E. Tauchen 
Duke University 


This paper tests Mincer’s minimum-wage model by estimating 
reduced-form wage and employment equations for both the covered 
and uncovered sectors in nine regions of the United States. As 
theory predicts, in regions with comparatively small covered-sector 
demand elasticities, the northern and midwestern regions, the 
uncovered-sector wage increases after a minimum-wage hike; and in 
regions with comparatively large demand elasticities, the southern 
and western regions, the uncovered-sector wage decreases. Because 
of data limitations the uncovered-sector employment effect could 
not be estimated sharply, and so its relationship to the covered-sector 
demand elasticity is weak. 


I. Introduction 


Economic theory predicts that a minimum-wage hike decreases em- 
ployment of low-skill workers in the covered sector. Theory also 
suggests that a minimum-wage hike affects the equilibrium wage and 
the level of employment in the uncovered sector though, as pointed 
out by Mincer (1976), the directions of the uncovered-sector effects 
are ambiguous. On the one hand, if the absolute labor demand 
elasticity is relatively small so that firms eliminate only a few covered 
jobs after the wage hike, then workers find the minimum wage attrac- 
tive enough to justify the risk of unemployment while searching for 
jobs in the covered sector. In this case labor leaves the uncovered 
sector, and the wage thereby increases. On the other hand, if the 
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demand elasticity is relatively large so that the return to search is low, 
then displaced workers enter the uncovered sector, and the wage 
decreases. 

The objective of this paper is to test Mincer’s characterization of the 
effects of a minimum-wage hike on the uncovered sector. In the 
empirical work the covered sector is an aggregation of four low-wage 
nondurable manufacturing industries; the uncovered sector is ag- 
riculture prior to coverage under the Fair Labor Standards Act. 
Time-series data are used to estimate the covered-sector labor de- 
mand elasticity and the uncovered-sector wage and employment ef- 
fects of a minimum-wage hike for nine regions of the United States. 
As theory predicts, in regions with comparatively small demand elas- 
ticities, the northern and midwestern regions, the uncovered-sector 
wage increases after a minimum-wage hike; and in regions with com- 
paratively large demand elasticities, the southern and western re- 
gions, the uncovered-sector wage correspondingly decreases. The 
uncovered-sector employment effect is not as sharply estimated as the 
wage effect, and so its relationship to the demand elasticity is weak. 
This result is explained by limitations of the data. 

There is a potentially serious problem with using regional time- 
series data to test the minimum-wage model. The problem arises 
because it is impossible to get long time series of data on the level of 
employment of persons earning exactly the minimum wage. Al- 
though average hourly earnings in the four low-wage industries are 
among the lowest in the economy, they are still well above the 
minimum wage, indicating that the data series contain contributions 
from high-wage labor. Thus the dependent variables in the re- 
gressions are total employment and the average wage, both aggre- 
gated over the wage distribution. It is conceivable, then, that the 
observed regional variation in the covered-sector demand elasticity is 
a statistical artifact reflecting regional differences in the mix of low- 
and high-wage labor. In other words, the true demand elasticity for 
minimum-wage grade labor could be the same in all regions, while the 
estimated demand elasticity is larger in the South simply because 
there are more low-wage workers there. 

The evidence presented in subsequent sections suggests that the 
spurious aggregation effect is small. Briefly, the argument is as fol- 
lows. Think of low- and high-wage labor as imperfect substitutes in a 
production function with constant returns to scale. Now let the 
minimum wage increase by | percent. There are both scale (output) 
and substitution effects. The scale effect causes employment of both 
factors, and hence total employment, to decline equiproportionately. 
The substitution effect, on the other hand, causes low-wage employ- 
ment to decline and high-wage employment to increase. Intuitively, 
and as is shown in Appendix A, the observed proportionate increase 
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in the average wage (AW/W) is no less than the share (51) of low-wage 
labor in total labor costs; if the substitution effect is zero, then AW/W 
equals s,. The regional estimates of AW/W from the regressions turn 
out to be very close to regional estimates of s, derived from indepen- 
dent wage distribution data. This can happen only if the major impact 
of a minimum-wage hike is the scale effect and not the substitution 
effect. Since the scale effect dominates, the observed proportionate 
decline in total employment must be close to the proportionate de- 
cline in low-wage employment; that is, the aggregation bias is small 
in the covered sector. 

The importance of the scale effect together with the cost-share data 
can explain why there are regional differences in the demand elastic- 
ity. The size of the scale effect depends upon the increase in the 
marginal cost of production caused by a hike in the minimum wage. 
Since s, is 0.14 in the North and 0.32 in the South, the proportionate 
increase in marginal cost is larger in the South, and so output and 
total employment contract proportionately more there. This is con- 
sistent with the idea that one purpose of the national minimum wage 
is to reduce the cost advantage of firms located in low-wage regions of 
the country. It is also consistent with Silberman and Durden’s (1976, 
p. 325) finding that Southern congressmen are less likely than other 
congressmen to vote in favor of a minimum-wage hike. 

The remainder of this paper is organized as follows. Section II 
describes a simple two-sector labor market model which is used to 
guide the empirical work. Sections III and IV report the empirical 
findings for the covered and uncovered sectors. Section V uses the 
theoretical model to interpret the results. Section VI contains the 
concluding remarks. 


II. Two-Sector Model 


The following two-sector model of the low-wage labor market is used 
to illustrate the restrictions being tested and to guide the empirical 
work. The model consists of two labor demand schedules (one for 
each sector), an equilibrium condition connecting the minimum wage 
and the competitively determined uncovered-sector wage, and an 
aggregate labor supply relationship. Formally, the model is: 


labor demand (covered): N° = D°(W/W,, X°), D§<0, (la) 
labor demand (uncovered): N“ = D\W"/W,, X“), D%* <0, (1b) 


rae A? 5NC 
. Es |! eo oe = 
equilibrium condition: W ( 7 4+ SNe jw, OF he 1c) 


labor supply: N°+N“+U=S(W*"P), S'>0, (1d) 
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where N¢%, N“ = employment in the covered (c) and uncovered (u) 
sector; W = minimum wage; W“ = wage in the uncovered sector; W, 
= control wage, W, > W; X°, X" = vectors of exogenous shift vari- 
ables; U = unemployment; 6 = turnover parameter, and P = general 
price level. The labor demand equation (1a) expresses employment in 
the covered sector, N°, as a function of the relative minimum wage, 
W/W,, where W, is an exogenously determined measure of high-skill 
wages, and as a function of exogenous shift variables, X°. Similarly, 
equation (1b) expresses employment in the uncovered sector as a 
function of the uncovered-sector wage relative to the high-skill wage, 
W«/W,, and as a function of exogenous variables, X“. Equation (1c) is 
from Mincer (1976). In equilibrium the uncovered-sector wage must 
equal the expected return from search in the covered sector (equal to 
the probability of getting a job times minimum wage). The parameter 
8 in (Ic) is the proportion of covered-sector employees who each 
period must enter an employment lottery with the unemployed. Fi- 
nally, equation (1d) relates the total labor force to the real wage in the 
uncovered sector. 

The four endogenous variables in the system (lyare NG SN2 SWAP ana 
U; the exogenous variables are W, W2, P, X°, and X“. Note first that 
(la) is the reduced-form expression for N°. Now write the reduced 
form for the other three endogenous variables as 


NS INE WV gee al (2a) 
We a WAW, Ws, PS Xen) (2b) 
U= UW, WaPo d xX (2c) 


Theory places restrictions on the reduced-form equations. Specifi- 
cally, Mincer shows 


aw = Om as = ). (3a) 
ON: ee < 
“ow = (Omasmece = 5, (3b) 


where é¢ is the absolute elasticity of labor demand in the covered 
sector (the absolute elasticity of [la] with respect to the minimum 
wage). The unemployment effect is positive, 

aU 


We (4) 


and the sign of the labor force effect agrees with the sign of the 
uncovered-sector wage effect, 
aN* ON" 0U 
fe ee eee 
aw ow 4 aw = 





O as e 36. (5) 
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Some of these results have been previously investigated empirically. 
There are, of course, many studies of the disemployment effect (1a) 
and of the unemployment effect (4). The Mincer study measures the 
labor force effect (5) for teenagers and young adults. After finding a 
negative labor force effect, Mincer infers e° > 8 and uses (3) to 
conclude that OW“/aW < 0, that is, that, asa group, low-wage workers’ 
prospects decline after a minimum-wage hike. Gardner’s (1972) ear- 
lier work with aggregate U.S. agricultural data is consistent with 
Mincer’s findings, though Gardner’s estimates lack statistical signifi- 
cance at conventional levels. The purpose of this study is to use 
regional variation in the demand elasticity e¢ to test the characteriza- 
tion (3a) and (3b). If there is small regional variation in the parameter 
5, then regions with relatively small values of e¢ should show OW“/@W 
> 0,0N"/8W < 0, and vice versa for regions with relatively large values 
of 2¢. 


Ill. The Covered Sector 


The first step in the empirical work is to estimate the demand equa- 
tion (la) to get regional estimates of the covered-sector demand 
elasticity, e°.' The data are annual observations on employment 
(1949-74) and average hourly earnings (1958-74) in an aggregation 
of four low-wage nondurable manufacturing industries: tobacco (SIC 
21), textiles (SIC 22), apparel (SIC 23), and leather (SIC 31).? As 
mentioned in the Introduction, there is a problem because the em- 
ployment variable includes workers for whom the legal minimum 
wage is economically ineffective. The effects of this skill-group aggre- 
gation can be characterized. Appendix A shows that regardless of the 
substitution pattern within aggregate labor, the observed (absolute) 
disemployment elasticity underestimates the sought-after demand 


1 The states within each region are: New England: Vt., Mass., N.H., Conn., R.L., 
Maine; Middle Atlantic: N.Y., Pa., N.J.; East North Central: Ohio, Wis., Ill., Ind., 
Mich.; West North Central: Mo., lowa, N.Dak., S.Dak., Nebr., Kans., Minn.; South 
Atlantic: Va., Del., Ga., Md., W.Va., N.C., S.C., Fla.; East South Central: Ky., Tenn., 
Miss., Ala.; West South Central: Ark., La., Tex., Okla.; Mountain: Nev., Wyo., Mont., 
N.Mex., Utah, Idaho, Colo., Ariz.; Pacific: Calif., Oreg., Wash. 

2 Two-digit manufacturing industries were defined as low-wage industries if U.S. 
average hourly earnings in 1960 were no more than $1.70; the legal minimum was 
upped from $1.00 to $1.15 in September 1961. Only the four indicated nondurable 
manufacturing industries met the criterion. No other two-digit nondurable manufac- 
turing industry had an average wage below $2.00 in 1960; the lowest average wage in 
1960 of two-digit durable industries was $1.88 in furniture (SIC 25). Annual observa- 
tions on state employment and average hourly earnings were collected from U.S. 
Bureau of Labor Statistics (1977). In the case of missing values, the data from nearby 
larger states were spliced into the shorter series. This allowed development of usable 
state employment series from 1949 forward and wage series from 1958 forward. The 
series were aggregated over the four industries and over contiguous states to nine 


regions. 
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TABLE! 


CovERED-SECTOR EMPLOYMENT AND WAGE EQUATIONS: 
SIGNIFICANCE TESTS# 

















DEPENDENT VARIABLE: DEPENDENT VARIABLE: 
n° (1949-74) w° (1958-74) 
SIGNIFICANCE TEST SIGNIFICANCE TEST 
JBN) F (3,8) 
w We w We 
[-1 to +1] [-lto +1] {[-1 to +1] {[-—lto +1] 

United States eK ek K * kK 
New England en 7K oth sok 
Middle Atlantic eo eK —e alas 
East North Central sere 7K wt kok 
West North Central * KK eK KEKK 
South Atlantic kK EK 4k kkk 
East South Central kok eK sek eRKK 
West South Central ok SK eK kK 
Mountain sek ek sok N.A» N.A. 
Pacific kK ek keok a SSeKK 





Nore.—Dependent variables: n° = log of total employment, four low-wage nondurable manufacturing industries, 
by region as indicated; w° = log of average hourly earnings, four low-wage nondurable manufacturing industries, by 
region as indicated. Independent variables: w = log of minimum wage; w, = log of average hourly earnings, all 
manufacturing, excluding average hourly earnings in the four low-wage industries; z = log of the FRB index of 
industrial production, all manufacturing. Lags are in brackets; negative lags are leads (future terms). 

@ Test that indicated group of distributed lag coefficients is zero. 

> No wage data are available for the Mountain region. 

* 15%. 

** 10%. 

RK 5%. 

RAK 1%. 


elasticity. In addition, the observed elasticity of the covered-sector 
wage with respect to the minimum wage overestimates the share of 
minimum-wage grade labor in total labor costs. Below, evidence will 
be presented that e° is only slightly underestimated. 

To focus attention on economics the less important technical details 
of the regressions are put in Appendix B. The results in tables | and 2 
are from distributed lag regressions of the logs of covered-sector 
employment and average hourly earnings, n° and w‘, on the log of the 
minimum wage, w, the control wage, w2, and the cyclical index, z. The 
control wage is the log of average hourly earnings, all manufacturing, 
net of wages in the four low-wage industries; the cyclical variable is 
the log of the FRB index of industrial production.’ Each regression 


3 w, we, and z are the logarithms of the annual average of monthly values. The level 
W, was computed as W, = (EW — %,E,W,)/(E — X,E;), where E is production worker 
employment and W is production worker average hourly earnings. Unsubscripted 
values refer to all manufacturing; the subscript) indexes the four low-wage industries. 
The basic data were taken from U.S. Bureau of Labor Statistics (1976) and from issues 
of Employment and Earnings (U.S. Bureau of Labor Statistics 1976-77). The industrial 
production index was taken from U.S. Department of Commerce (1976) and issues of 
the Survey of Current Business (U.S. Department of Commerce 1976-77). 
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included trend and a constant.* The regressions do not control for 
state minimum wages because in this sample state minimums are 
either at or below the federal minimum wage.° 

The F-tests in table 1 indicate that the distributed lag coefficients on 
the minimum wage are significant as a group in nearly all of the 
equations. Now consider the long-run elasticities reported in table 2. 
As theory predicts, the sign of the minimum-wage effect is negative in 
all of the employment equations and positive in all but one wage 
equation. Moreover, the magnitude of the effect displays important 
regional variation. The disemployment effect appears to be negligible 
in the Northeast and upper Midwest, and it seems to be rather 
important through the South, the Southwest, and the Pacific coast. 
Also, the elasticities of employment and the average wage with respect 
to the minimum wage are consistent with one another, in the sense 
that, wherever the effect on the wage is small, so is the effect on 
employment. 

To my knowledge, there are no other comparable published esti- 
mates of the disemployment effect by region or state. The estimated 
elasticity of —0.29 for the United States, however, is comparable with 
the findings of many other researchers. It is just outside Gramlich’s 
(1976, p. 431) bounds of —0.05 to —0.25 for published estimates.® 

Comparing these long-run effects to regional data on the wage 
distribution in the covered sector suggests that the major impact of 
the minimum wage is the scale or output effect. To see this, consider 
the 1964 wage distribution displayed in table 3. These data were 


4 Throughout this paper regional dependent variables are regressed on national 
variables. This is due to data limitations, but a strong case can be made for this practice 
even if regional data were available. Truly exogenous variables are driven by broad 
economy-wide forces and should not contain endogenous local fluctuations. For the 
quarterly agricultural regressions of Section IV it was possible to compute regional 
agricultural prices but only on an annual basis. Quarterly models with these prices 
repeated four times gave much lower R? than the reported equations. Evidently, the 
time variation is more important than the regional variation. 

> In the Northeast, state minimum wages tend to be just at the federal wage, and 
outside the Northeast they are usually well below the federal wage. Movements in state 
wage floors tend to parallel changes in the federal wage, and the main effect of state 
minimums is to extend coverage to industries only partly covered by federal legislation: 
retail trade, laundries, services, etc., industries excluded from the sample. In 1970 the 
federal minimum was $1.60, while the simple averages of state minimums by region 
were: New England, $1.60; Middle Atlantic, $1.60; East North Central, $.98; West 
North Central, $.65; South Atlantic, $.77; East South Central, $.18; West South Cen- 
tral, $.84; Mountain, $1.07; and Pacific, $1.50. New York was the only state with a 
minimum wage above $1.60; its minimum was raised from $1.60 to $1.85 effective July 
1, 1970. Washington, D.C., and Alaska had 1970 minimums of $1.80 and $2.00, 
respectively, but were excluded from the sample. Source: U.S. Employment Standards 
Administration (19726, p. 52, table 5). 

6 Many of these studies use the ratio of teenage to adult employment as the depen- 
dent variable, and so they estimate only a pure substitution effect (Welch 1974). This 
study estimates a combined scale and substitution effect. 
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TABLE 3 


WaGE DistTRIBUTION FOR THE AGGREGATION OF Four NONDURABLE 
MANUFACTURING INDusTRIES, March 1964 (%) 

















Straight-Time United North 
Wage ($) Less Than States Northeast South Central West 
1.25 1 1 1 1 1 
1.30 16 ii 22 18 14 
1.35 2s 16 30 23 19 
1.40 29 21 39 31 22 
1.45 36 28 47 30 28 
1.50 41 32 53 42 31 
2.50 91 85 98 94 83 
3.50 98 96 100 100 97 

Total 100 100 100 100 100 
Average hourly 

earnings ($) 1.76 1.90 1.59 ee 1.92 
Under $1.40 shares 

of total: 
Employment (&,) Bo BOA poo) a) a2 
Labor costs (s,) Boe 14 pa 24 aL 





Note.—In September 1963 the basic minimum wage was raised from $1.15 to $1.25; in February 1967 it was 
raised to $1.40. The distributions were computed from table 4 of U.S. Wage and Hour and Public Contracts 
Divisions (1965). Accounting identities and interpolations were used to get complete data for the North Central and 
West regions, and so the distributions for these two regions are less reliable than the other distributions. 


collected in a special survey of firms by the Department of Labor. 
Define low-wage workers to be those within 10 percent of the 
minimum wage of $1.25 (10 percent was about the average percent- 
age increase in the minimum wage in the early 1960s). Then the share 
of low-wage labor in total labor costs is 0.14 in the Northeast and 0.32 
in the South. These cost shares indicate that, if firms held employ- 
ment constant, then a 10 percent minimum-wage hike would drive up 
labor costs by 1.4 and 3.2 percent in these two regions. Furthermore, 
if firms respond to the wage hike by contracting low- and high-wage 
employment equiproportionately, then the average wages in the 
North and South would increase by 1.4 and 3.2 percent. According to 
the regressions in table 2, the average wages actually increase by 1.5 
percent in the North and 3.2 percent in the South. (These figures are 
10 umes the unweighted mean elasticity for the two northern regions 
and three southern regions.) A similar agreement between the cost 
share and the wage elasticity exists in the North Central region and in 
the West. The data appear to say that covered-sector firms respond to 
a minimum-wage hike by taking steps to reduce total labor input and 
that both groups of workers suffer equiproportionate disemploy- 
ment. Furthermore, since the substitution effect appears to be small 
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relative to the scale effect, the estimated disemployment elasticities 
are close to the true demand elasticities for low-wage labor. 


IV. The Uncovered Sector 


The purpose of this section is to estimate the minimum-wage effect on 
uncovered (agricultural) employment and wages. The reduced-form 
equations (2a) and (2b) give the specifications. Specifically, the logs of 
agricultural employment and wages are regressed on the minimum 
wage, the control wage, the general price level, and the shift variables 
appearing in (la) and (1b). Data are available quarterly, and the opera- 
tional definitions are as follows. Employment and wages, n“ and w”, 
are, respectively, the logs of hired farm employment and the hourly 
wage exclusive of room and board. Agricultural economists consider 
this wage to be more reliable than the U.S. Department of Agri- 
culture’s various composite measures. In the list of independent 
variables, the minimum wage variable is cw, the legal minimum multi- 
plied (before logging) by the Moore index of the proportion of 
nonagricultural employees covered by the legislation. Intuitively, the 
impact of a 10 percent minimum-wage hike on the uncovered ag- 
ricultural labor market is much different if 20 versus 90 percent of 
the nonagricultural work force is covered. The control wage w2 and 
the cyclical index z are the same variables as used in Section III. The 
general price level p is the log of the CPI. Finally, the uncovered- 
sector shift variables are the logs of the USDA’s index of prices 
received by farmers, p“, and the index of prices paid by farmers for 
production items g. The index g excludes wages.’ 

The basic sample period (before lags and leads) is 1947:I-1966:1V. 
Truncation at 1966 restricts the estimation to a period in which 
agriculture was effectively uncovered by federal legislation. On Feb- 
ruary 1, 1967, coverage was extended to employees of large farms. 
The Department of Labor estimated that by 1971 about 48 percent of 
the agricultural work force was covered by federal legislation.’ Also, 
after extension of federal coverage to agriculture, several states ex- 
panded their coverage to agriculture. Prior to 1967, state agricultural 
minimums were considered unimportant.® 


7 The agricultural employment and wage series were obtained from various issues of 
Farm Labor (U.S. Department of Agriculture 1951-75). The input and output prices 
indexes were obtained from issues of Agriculture Prices, Annual Summary (U.S. Depart- 
ment of Agriculture 1961-72). 

* For details see U.S. Employment Standards Administration (1972a, pp. 5-9). 

* In 1966 only three states in the sample had a minimum wage applicable to agricul- 
tural workers: California, $1.30; Michigan, $1.15; and Wisconsin, $1.00. The Califor- 
nia and Wisconsin minimums applied only to women and minors. Source: U.S. Wage 
and Hour and Public Contracts Divisions (1966, p. 30, table 12). Michigan and Wiscon- 
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Two conclusions are apparent from the regressions. First, the 
significance tests in table 4 confirm the well-known fact that aggregate 
cyclical forces play a strong role in the determination of agricultural 
labor supply. According to the U.S. long-run elasticities in table 5, a 
10 percent increase in industrial production is associated with an 11.3 
percent decrease in hired farm employment and a 2.9 percent in- 
crease in the hired farm wage. This is important because it indicates 
that there are indeed sizable labor flows between agriculture and the 
covered sector. 

Second, and most important, changes in the minimum wage do 
affect agricultural employment and wages. The F-tests in table 4 show 
that as a group the distributed lag coefficients on the minimum wage 
are significant at 10 percent in five of 10 employment equations and 
in eight of 10 wage equations. Note that in table 5 the long-run U.S. 
employment elasticity of —.18 is insignificant, but the wage elasticity 
of —.08 is significant. The latter measurement indicates that for the 
entire United States a minimum-wage hike tends to depress the 
uncovered-sector wage. This finding is consistent with both Mincer’s 
and Gardner’s results. 

Examination of the regional results in table 5 suggests that there are 
important regional differences in the welfare effects of the minimum 
wage. Notice that the effect on the uncovered-sector wage is sig- 
nificantly positive in the New England, Middle Atlantic, and West 
North Central regions; it is significantly negative in the South Atlantic 
and Pacific regions. Thus, as indicated by (3a), the wage in the un- 
covered sector can indeed rise cr fall after a minimum-wage hike in 
the covered sector. Viewed in isolation, however, the uncovered- 
sector results are insufficient to verify the model. Indeed, if the 
hypothesized economic mechanisms are really at work, then the ab- 
solute disemployment elasticities from Section III should be largest in 
the regions where the uncovered-sector wage falls. 


V. Sectoral Comparisons 


Consider the inequality (3a). If the demand elasticity e° is less than the 
turnover parameter 6, then after a minimum-wage hike the flow of 
labor out of the uncovered sector increases the wage there; if e° is 
greater than 6, then the flow of labor into the uncovered sector 
decreases the wage. Therefore, if there is small regional variation in 


sin are in the East North Central region, and their agricultural minimum wages possibly 
could explain the results for that region. Consider, though, the fact that California 
accounts for 88 percent of agricultural employment in the Pacific region, and it turns 
out that after a hike in the U.S. minimum wage the agricultural wage declines in the 
Pacific region. 
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|.New England 6. East South Central 
2.Middle Atlantic 7. West South Central 
3.East North Central 8. Mountain 

4. West North Central 9. Pacific 

5. South Atlantic 


Fic. 1.—The relationship between the uncovered-sector wage and employment 
effects and the covered-sector disemployment elasticity. 


the parameter 6, then plotting regional observations on the effect of 
the minimum wage on the wage in the uncovered sector against the 
demand elasticity e° should give a downward sloping relationship 
crossing the horizontal axis at 6. By (3b), plotting the effect of the 
minimum wage on employment in the uncovered sector against e¢ 
should give an upward sloping relationship also crossing the hori- 
zontal axis at 6. Panels A and B of figure 1 are the corresponding 
empirical diagrams. The dependent variable in panel A is the ob- 
served long-run elasticity of the uncovered-sector wage with respect 
to the minimum wage (from table 5); the dependent variable in panel 
B is the long-run uncovered-sector employment elasticity (also from 
table 5). The independent variable in each panel is the absolute value 
of the long-run covered-sector disemployment elasticity (from table 
2): 

As predicted by theory, panel A indicates an inverse relationship 
between the effect of the minimum wage on the uncovered-sector 
wage and the demand elasticity. The fitted line’s horizontal intercept 
of 0.32 is an estimate of the critical demand elasticity (5) that deter- 
mines the sign of the wage effect in the uncovered sector. The esti- 
mate 6 = 0.32 says that in the steady state at least 32 percent of 
covered-sector employees compete with the unemployed for em- 
ployment. In contrast, panel B suggests no relationship between the 
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effect of the minimum wage on uncovered employment and the 
demand elasticity. The data points are essentially random deviations 
about the horizontal axis. Note that the F-tests in table 4 indicate that 
the minimum wage affects uncovered employment. The sign and size 
of the long-run effect, however, are measured very imprecisely. 

The skill-group aggregation in the uncovered sector data can ac- 
count for the apparent conflict between theory and panel B of figure 
1. When the low-skill wage changes in the uncovered sector, there are 
scale and substitution effects. As noted before and as proved in 
Appendix A, the larger is the substitution effect relative to the scale 
effect, the smaller is the observed (absolute) employment elasticity 
relative to the true proportionate change in low-wage labor. Evi- 
dently, the elasticity of substitution between low- and high-skill labor 
in agriculture is large enough to make this skill-group aggregation 
suppress the uncovered employment effect to an undetectable level. 
Notice that this is the reverse of the situation in the covered sector. As 
was noted in Section III, in the covered sector the scale effect domi- 
nates the substitution effect; in contrast, in the uncovered sector the 
substitution effect dominates the scale effect. 


VI. Summary and Conclusion 


This paper examines the covariation over regions in the effects of the 
minimum wage on wages and employment in the covered and un- 
covered sectors. Its major empirical finding is that the pattern of 
results is consistent with theory. Specifically, after a minimum-wage 
hike the uncovered-sector wage increases in the five regions with 
relatively small covered-sector demand elasticities: New England, 
Middle Atlantic, East North Central, West North Central, and East 
South Central; it decreases in the four regions with the relatively large 
covered-sector demand elasticities: South Atlantic, West South Cen- 
tral, Mountain, and Pacific. In addition, Mincer’s and Gardner’s ear- 
lier work suggesting that the uncovered-sector wage decreases in the 
entire United States is consistent with these findings, for their results 
are dominated by the four large regions (in terms of low-wage em- 
ployment) in which the uncovered-sector wage falls. This paper’s 
results have some bearing on the issue of welfare effects of the 
minimum wage. It seems fair to conclude that the minimum wage 
increases the well-being of all low-wage workers in the New England, 
Middle Atlantic, East North Central, West North Central, and East 
South Central regions. In the remaining regions, however, the 
minimum wage increases the well-being of only those workers having 
covered-sector employment. 
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Appendix A 


The purpose of this Appendix is to characterize the effects of using total 
employment and the average wage as dependent variables in the regressions. 
Suppose low-skill class 1 labor (N,) and high-skill class 2 labor (Nz) are 
imperfect substitutes in a production function with constant returns to scale. 
Let the class 1 wage (W,) increase. Then 


N, = (scale effect — 52012)W,, 
N, = (scale effect + 51012), 


where ~ denotes proportionate change, s, and s, are the factors’ shares in 
expenditures, and g,, > 0 is the absolute Hicks-Allen partial elasticity of 
substitution. Let k, and k, denote the factors’ shares in total employment; then 
the proportionate change in total employment is 


N =k,N, + kN, 
= [scale effect + (kp — 52)0121W4, 
which is less negative than N,. The change in the average wage is 
W = (Wy + haW2) 
= 5,W, + () — hy), — No) 
=I — (1 — Aon, 


which exceeds s,W, because s, < k, (class 1 labor is the low-wage factor). 
Clearly the characterization applies equally well to either sector. (The above 
arguments were found by an anonymous referee.) 


Appendix B 


This Appendix discusses the technical econometric issues associated with the 
empirical work. 


Specification 


There are three important points concerning the covered-sector equations. 
First, adjustment costs (hiring and separation costs) induce a dependence of 
factor demand on lagged and expected future factor prices (Sargent 1978). 
This motivates using distributed lag methods and including a lead value of 
the minimum wage, for its movements are known well in advance. Second, 
minor distortions due to temporal aggregation (the unavoidable use of annual 
instead of monthly data) are apparently the cause of a significant lead 
coefficient on the control wage. This was verified by using Geweke’s (1978) 
results. The effects of the temporal aggregation are minor because the ex- 
planatory variables are annual averages and the long-run effects are there- 
fore estimated consistently (see Geweke’s theorem 2). Third, the temporal 
aggregation and the forecasting of factor prices are good reasons to expect 
the homogeneity of degree zero of (la) in nominal variables to break down in 
practice. Consequently, w and w, are entered separately in (la) and the 
homogeneity constraint tested (and rejected). The coefficients have a “real” 
interpretation, for the entire history of each nominal variable is held constant. 

The same comments apply to the uncovered sector regressions, though 
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with quarterly data the temporal aggregation problems are absent. To purge 
the series of seasonal effects, each regression included seasonal dummies and 
the products of seasonals with trend. 


Serial Correlation 


In the annual covered-sector regressions a first-order Cochrane-Orcutt cor- 
rection was used; in the quarterly uncovered-sector regressions ad hoc 
prefilters were used together with fifth-order Cochrane-Orcutt. 


Exogeneity 


Table B1 reports the outcome of batteries of Sims’s (1972, 1977) exogeneity 
test. The test consists of the appropriate F- or t-test on lead coefficients. The 
paucity of rejections indicates no serious problems with a lack of exogeneity in 
either set of results. Since lead values of w and wy, enter the covered-sector 
equations for other reasons, the test’s power is obviously low in that set of 
results. 
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Taxes, Inflation, and the Durability of Capital 





Andrew B. Abel 
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The choice of capital durability is affected by the rate of inflation 
because nominal depreciation deductions are based on historical cost 
rather than replacement cost. This paper analyzes a model with 
competitive firms and intertemporally optimizing consumers and 
demonstrates that an increase in the rate of inflation will lead to an 
increase in the durability of capital, if the ratio of the nominal 
discount rate to the rate of depreciation exceeds a critical value. 
However, if this ratio is less than the critical value, this effect is 
reversed. The effect of costs of adjustment on the critical value is 
analyzed. 


I. Introduction 


Since depreciation deductions for tax purposes are based on historical 
cost rather than replacement cost, inflation can affect the investment 
and capital durability decisions of firms. This problem was recently 
analyzed by Auerbach (1979), using a general equilibrium model in 
which investment and durability are determined by optimizing firms 
and consumption is determined according to a proportional savings 
function. To analyze the effect of inflation on the choice of asset 
durability, Auerbach simulates his model, using a Cobb-Douglas 
technology with reasonable parameter values.' His simulation results 
indicate that higher inflation leads firms to choose more durable 
capital, although we are left with a caveat about the robustness of the 
results. 


This paper was written while I was an assistant professor at the University of Chicago 
Department of Economics. I thank Alan Auerbach, Dennis Carlton, Stanley Fischer, 
Robert LaLonde, Frederic Mishkin, and Michael Mussa for comments on earlier drafts 
of this paper. 

‘ Auerbach also assumes values for the share of government expenditure in net 
output, the fraction of disposable income saved, and the corporate tax rate. 
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By modifying Auerbach’s model to make consumption decisions 
based on intertemporal utility maximization rather than on a propor- 
tional saving function, we can derive analytically the effect of inflation 
on durability without having to specify the form of the production 
function. The direction of the effect of inflation on durability is 
determined by whether the ratio of the nominal discount rate to the 
depreciation rate exceeds a critical value which depends solely on the 
corporate tax rate and the rate of the investment tax credit. If the ra- 
tio of the nominal discount rate to the depreciation rate exceeds the 
critical value—as is the case in Auerbach’s simulations?—then an in- 
crease in the rate of inflation leads to an increase in durability. How- 
ever, if the ratio of the nominal discount rate to the depreciation rate 
is less than the critical value, then an increase in the rate of inflation 
leads to less durable capital. This reversal of Auerbach’s finding is 
more likely to apply to durable equipment rather than to structures 
since equipment has a higher depreciation rate than structures. 

A second modification of Auerbach’s model is the incorporation of 
adjustment costs into the model. The presence of convex costs of 
adjustment removes the indeterminacy of the individual firm’s rate 
of investment in the Auerbach model. It should be noted that one 
complication of introducing adjustment costs is that the critical value 
of the ratio of the nominal discount rate to the depreciation rate is no 
longer a function of only tax parameters. The critical value of this 
ratio will, in general, depend on technology, although we can place 
bounds on this critical value. Using these bounds on the critical value 
of the ratio of the nominal discount rate to the depreciation rate, we 
can still derive fairly strong results about the effect of inflation on 
durability based only on this ratio and on the tax parameters. In 
general, if the nominal discount rate exceeds the rate of depreciation, 
an increase in the rate of inflation reduces the rate of depreciation. If 
the nominal discount rate is “enough” less than the depreciation rate, 
where enough depends only on tax parameters, an increase in the 
rate of inflation increases the rate of depreciation. In the region 
between the upper and lower bounds on the critical value, the critical 
value depends on technology. Note, however, that this complication 
arises from the introduction of adjustment costs. In the absence of 
adjustment costs, the critical value of the ratio of the nominal discount 
rate to the depreciation rate depends only on tax parameters. 


* For a more precise discussion of Auerbach’s simulation results, see Section III. 

3 This statement implicitly assumes that the investment tax credit rate is the same for 
structures and equipment, so that the critical value is the same for both types of capital. 
However, under the current U.S. tax laws, the investment tax credit is applicable to 
equipment but not to structures. Therefore, as will become clear in Section III, the 
critical value is smaller for equipment than for structures. This effect may weaken or 


reverse the statement in the text. 
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In Section II we develop a model of a competitive firm and analyze 
the investment and durability decisions. The model of the firm is 
incorporated into a general equilibrium model in Section III, where 
we analyze the effect of changes in the rate of inflation. Section IV 
presents concluding remarks. 


II. The Model of the Firm 


Since the model of the firm in this paper differs from Auerbach’s 
model only by the inclusion of adjustment costs and the investment 
tax credit, the description of the model will be brief; further descrip- 
tion may be found in Auerbach. 

We assume that the firm is a price taker in its output market and in 
the labor market. The price of output p and the nominal wage rate w 
both inflate at rate 7, so that the real wage is constant over time. Let or 
be the (constant) rate of exponential decay of capital goods installed at 
time ¢. Ex ante, 5; is a decision variable of the firm. We assume that the 
instantaneous flow of capital services from a unit of capital is an 
increasing concave function of 6, A(6), with A’! >:0A" <2 0! 

Investment in physical capital consists of purchasing a unit of the 
homogeneous output and installing it in place at some cost which is an 
increasing convex function of the rate of investment. Let J; be nominal 
investment at time ¢, so that J,/p, is the number of units of output 
bought and installed as physical capital. We assume that the real cost 
of installation is an increasing convex function of real investment, so 
that the nominal cost of installation at time ¢ is p,c(J;/p,), with c(0) = 
c'(0) = 0, and c’ > 0, c” > O for I,/p; > 0.4 We assume that these 
installation costs can be expensed for tax purposes. Note that the 
assumption that the purchase of uninstalled capital goods is amor- 
tized over time while the installation cost is immediately expensed is 
similar to Auerbach’s case in which a fraction e of the cost of invest- 
ment is expensed.? 

Capital services at time s are 


KS, = [Aen tiipne toned (1) 


since (I,/p,e°“* is the amount of capital installed at time ¢ which 
remains at times and A (8,) is the flow of capital services per unit of this 


4 Adjustment costs have been incorporated into models of investment by Eisner and 
Strotz (1963), Lucas (1967), Gould (1968), and Treadway (1969). Abel and Blanchard 
(1980) distinguish the cost of purchasing uninstalled capital from the cost of installing 
capital, as is done in the text. 

> There is a slight difference, however, in that the fraction of the total cost which is 
expensed in this model is endogenous, whereas the fraction e is exogenous in Auer- 
bach’s model. 
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capital. Gross output at time s is given by Y§ = H(KS,, L,), where H is 
first-degree homogeneous in KS and L. We also assume A xs, A,, Axsp 
> 0 and Hxs xs, Hy, < 0. 

Taxable corporate profits, which are revenues less wages, installa- 
tion costs, and depreciation allowances, are taxed at rate 7,0 <7< 1. 
Let D(x, 5) be the depreciation deduction per dollar of purchase cost 
of capital of age x which decays at rate 5, and let k (0 < k < 1) be the 
investment tax credit rate applied to the purchase of uninstalled 
capital. 

The objective of the firm is to maximize 


V = {a OP Rs erate. ape Usp im (lige (2) 


+ “| LG os Bydole-rds 


with respect to Ls, 6, and /,;, where r is the rate at which equity holders 
discount after-tax nominal flows.® Note that (2) is the same as Auer- 
bach’s equation 3 except for the inclusion of installation costs and the 
investment tax credit. For simplicity of notation, let 


z(6) = J, ve. Oma dx, (3) 


be the present value of depreciation deductions, and let p = r — 7 be 
the after-tax real discount rate. Differentiating V with respect to 6, 1), 
and L, and setting these derivatives equal to zero, we obtain:” 


(11) [46 ~ (6 — DAB) as(KS pL pes = nee 


0 
5, [1 —k — 72(8,)], 


(1 — 7)c’ Ue! pe) = (1 — 7)A(8)) J A xs{KS 5,L se PPO ds — (4b) 


Pha heen 2(O,), 
A (KS ,,L,) = wy/pr. (4c) 


These equations may be solved recursively.* At any point in time, ¢, 
KS, is fixed and the firm chooses L; to satisfy (4c). Note that, since the 


6 We assume that firms are financed entirely by equity, so that interest payments are 
not included in the expression for cash flow. 

7 The first-order conditions are identical to Auerbach’s eq. 4, except that (1 — 
t)c'(I;/p.) in (4b) and & in (4a) and (4b) are zero in Auerbach’s framework. 

8 If the installation costs were amortized over time rather than immediately ex- 
pensed, then the system would be only block recursive. Eq. (4c) could be solved first, but 
the analogues of (4a) and (4b) would have to be solved simultaneously. 
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real wage, w;/p;, is assumed to be constant over time, H,(KS;,L;) is 
constant over time. Since H(KS;,L;) is linearly homogeneous, the con- 
stancy of H,(KS,,L,) implies that Hx5(KS ;,L;) is constant over time. Let 
¢ be the constant value of Hxs(KS;,L;) and define (6; @) as 


Nona — Phew (5) 


Note that [(1 — 7)4(5,)6]/(p + 6,) is the present value of the stream of 
after-tax marginal products accruing to the undepreciated portion 
of a unit of capital installed at time ¢. In addition, by installing a unit of 
capital at time ¢, the firm reduces the present value of its taxes by k + 
7z(5,). Therefore, \(5; @) is the excess of the present value of the 
after-tax cash flow accruing to the marginal unit of installed capital 
over the cost of an uninstalled unit of capital. 

Note that the first-order conditions given in (4a) and (4b) may be 
expressed as? 


PP ee (Laren ty nearer) Oz(51)_ _ 
N65 4) = SS SE |") aan 0” nica) 


(1 — r)e'Telps) = A(8e ). (6b) 


To summarize, the firm may use the following sequential decision 
process. Given a real wage rate, the firm chooses a KS/L ratio to equate 
H, with the exogenous real wage. This KS/L ratio uniquely deter- 
mines @, the marginal product of capital services. Given @, the excess 
of the shadow value of installed capital over the net cost of uninstalled 
capital is a function, \(5;; #), of 5. According to (6a), the firm chooses 
6; to maximize )(6;; d).'° Given the resulting maximized value of \(8;; 
g), the firm will choose the real rate of investment to equate the 
marginal installation cost with (8,; @), as in (6b). This solution proce- 
dure applies at all points in time, even when the capital stock has not 
yet reached its steady-state value. Since the real wage rate is assumed 
constant, the marginal product of capital services, @, is also constant. 
Therefore, the A-maximizing value of is constant over time, and the 


° If we impose the constraint that gross investment is nonnegative, the first-order 
conditions with respect to J are: (a) (1 — 7)c’ (L/P) = X(8; b); and (6) (L,/p) (1. — r)c’ (,/p,) — 
A(6s; b)] = 0. If L:/p; > 0, then (6) implies eq. (6b). 

 Ifz = 8/(p + 8), as will be the case later in this note, then (6; d) = (kK - 1) + (1 - 
7)A(6)$(p + 8)] + [rd/(p + 8)], A'(8; 6) = [1p + 8)P2KM(1 — 7)[A'(8\(p + 8) — A(8)] + 
Tp}, and d"(8; d) = [—2/(p + 5)*1{(1 — 7)fL4'(8)(p + 8) — A(S)] + tp} + [1 — 7)6A4(8)/(0 
+ 6)]. Therefore, when \'(8; 4) equals zero, the term in braces in the expression for 
r"(8; db) is Zero and \"(5; @) = [(1 — 7)6A"(5)/(p + 8)] < 0. Hence, setting \'(5; @) equal to 
zero maximizes (5; ) with respect to 8. Note that later, when we examine the case in 
which z = [6/(p + aw + 8)], a similar line of reasoning will establish that \”(8; @) < 0 when 
’(6; ) = 0, as long as 7 is not too large. 
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real rate of investment is also constant." Given the constant value of 
real investment, say (//p)*, the steady-state capital stock is (1/5)(I/p)*, 
and the steady-state flow of capital services is [4 (5)/5](I/p)*. 


Ilf. A General Equilibrium Model of Investment and Durability 


In this section, we examine the effects of changes in the rate of 
inflation within the context of a general equilibrium model. Rather 
than posit a proportional savings function, as in Auerbach, we assume 
that consumption is determined as the outcome of infinite-horizon 
intertemporal utility maximization by identical households. As noted 
by Auerbach, the after-tax discount rate p will be constant in such a 
model. 

To keep the analysis simple but without any essential loss of 
generality, assume that the number of households is constant, and 
each household owns one firm and inelastically supplies L units of 
labor to its firm. Each household selects c; and m, to maximize 


in U(cz, me Pat (7) 


subject to c, = y; + (w,/p)L — (M,/p:), where c; is real consumption 
at time ¢t, m; is real balances at time ¢, M; is the growth in nom- 
inal money balances, and y, = (1 — 7)LH(KS,,L) — (w,/p,)L = chip yl= 
iS khy\inz t[llhper DD (t — s,6,)ds is the after-tax real cash flow of 
the firm. The maximization problem is based on the assumption that 
each household receives the net cash flow of the firm in addition to 
wage income and uses its total income either to consume or to add to 
its money balances. We also assume that the government uses tax 
revenue and the issuance of new money to purchase current 
output—which does not affect the utility function U(c;, m).” 

Solving the maximization problem (7), we can show that, in the 
steady state, equations (6a) and (6b) continue to hold. However, in this 
general equilibrium model with labor supply constant, ¢, the margi- 
nal product of capital services, is a decreasing function of the steady- 
state level of capital services [4(6)/d](//p). Therefore, for the steady- 
state values of the variables of interest, ¢, 5, and J/p, we have the 
following system of equations: 


™ Gould (1968) showed that, for a price-taking firm with constant returns to scale, the 
rate of investment is constant if the real wage rate is constant. 

® Abel and Blanchard (1980) present a more complete discussion of a general 
equilibrium model with taxes and adjustment costs. However, in that model the depre- 
ciation rate is fixed exogenously, and money balances do not appear. 
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_ s|4 rip: g' <0. (8a) 
d(8, b) = 0, (8b) 
(8, 6) — (1 — re" (I/p) = 0. (8c) 


Note that the system (8a)—(8c) could also represent the behavior of a 
single profit-maximizing firm with constant returns to scale with 
either some monopoly power in its output market or some monop- 
sony power in the labor market. 

To examine the effect of a change in the rate of inflation, we 
linearize the system (8a)—(8c) around its solution to obtain 


peel tn ee A(5 vie 

1 -g' “P [ar@) - 40) - 2. AOD || ad | | 0 

a My 0 dd |\=|—-\, (9) 
Ne 0 > (lees ick d([/p)| |~Ax 





where Ay = [(1 — 7)4(8)/(p + 8)] > 0, and A4 = (1 — 7)/(p + 8){A'(6) — 
[A (8)/(p + 8)]} < 0, and A < 0 because of the second-order condition 
for the optimal 5.’* Let A be the determinant of the square matrix in 
(9), so that 

I/p 


KR Nae Ag Oe i TAGs 40) lla Be On ae 


A 





© Neo 
(10) 


Note that A > 0 regardless of whether c” is positive or zero. 
Given the linearized system (9), Cramer’s rule can be used to de- 
termine the effect on 6 of changes in the rate of inflation 7, 


dd = ale 4, Nee ee A (9) 
P= [1 = )ehy = gS By (11) 
where B, = —AgAq + AA. In analyzing (11), we will also consider the 


following two special cases: (a) the absence of adjustment costs, so that 
c" = 0; and (6) the polar opposite case of extremely convex adjustment 


costs, so that c” = o. In the case without adjustment costs, which 
corresponds most closely to Auerbach’s model, it can be shown that 
dd ea ier A(8) eS 
aS Seu ae ae (12) 


where the coefficient on B, is positive. In the polar opposite case, it 
can be shown that 


13 Note that the element in the third row, second column of the square matrix in (9) is 
Xs, which is zero according to (8b). 
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sign oa =signdA, (c" =). (13) 

In order to analyze the expressions for d8/d7z, we must explicitly 

model the effect of inflation on the shadow price of installed capital. 

Following Auerbach, we assume that depreciation deductions reflect 
physical decay of capital but are based on historical cost, so that!* 


ee 
p 


eT Ees Oey 


Using this specification of depreciation deductions, we can easily show 
that 


_ (1 —7)A(8)d TO hs, 
d See Sra Ie (15a) 
eee ara 2 (0) t(p + 77) 
: p+6 a2) Bre Sera aaa oe 


Now we must calculate \,, \,, and B,,. Differentiating A with respect 
to 7, we obtain 


TO 

Az = ————~,, < 0. 16 

ap tart 6) Ge) 

Note that \, is negative because an increase in the rate of inflation 

reduces the present value of real depreciation deductions and hence 

reduces the shadow price of installed capital. Next, we differentiate \’ 
with respect to 7 to obtain 


bree gg Ore pm) = 0 ass2ptq. (17) 
Thus the sign of A, depends on whether the rate of depreciation 
exceeds the nominal discount rate. As shown in figure 1, \,, is negative 
in region / and is positive in regions JJ and III. 

Recall that B, = —jd, + AoA}, that Ay is positive, and that A, and Ag 
are negative. Thus in region /, where 6 < p + m7 and 4; is negative, B,, 
is negative. To determine the sign of B, when 6 > p + 7 requires 
more calculation. Using the expressions for Ag, Aj, Ax, and A,, we can 
show that 


ee ay ae | ]+43254 (18) 


A(s) ptod| ptatd) 








ie pt+6 (p+7+ 8) 


14 If depreciation deductions are based on replacement cost rather than historical 
cost, then z(5) = 8/(p + 5). In this case, changes in the rate of inflation have no effect on 


the firm’s decision problem. 
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ot (p + 7 = 6) 





Fic. 1 


It will now be useful to define g, the net cost of installed capital, 
ignoring the depreciation allowance, as 


Gf lie Keka lie TC (19) 


Using this expression for q, we obtain” 


Mer es 7 TA (8) ioc Seema eae, Ws a AISA 
ie RE aE EOC A (o + 7 








(20) 
Note that B, has the same sign as the term in brackets, so that 
B,20ash7s See (21) 


Thus the sign of B,, depends on whether (p + 77)/5 exceeds the critical 
value V1 — (z/q). Note that the critical value V 1 — (7/q) is less than or 


* Rewrite eqq. (6a) and (6b) as 


(SON pris. 20) | oe =t(p em) ' 
p+6 aXe) al (p + mw + 8)?’ ay 
and 
(1 — rh oe 78 ; 
p+6 eG ptamt+s6- oe 
Dividing (6a’) by (6b’), we obtain 
40) ee Le =1(p + 7) 





Aids) p+8 gqpt+m7+52?—T8p+7+ 8)’ 
which may be substituted into (18). 
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TABLE 1 


‘THE EFFECT OF INFLATION ON DURABILITY 




















SIGN OF a 
di 
I II II 
p+m.) Cry PAT ptT T 
Rectan han 1 aa ee ras aii Laer =a 
General c” = 0 = ? a 
No adjustment costs = — sf 
cl =o Wee ae ae 





equal to one, so that if p + 7 > 6, as in region /, then (p + 7)/5 exceeds 
the critical value, and B, is negative. Also note that, since q = 1 — k, 
with equality holding in the absence of adjustment costs, the critical 
value must be greater than or equal to V1 — [7/(1 — k)]. Therefore, if 
(9 + 7)/6 < V1 — [7/(1 — &)] as in region III, B,, is positive. 

Recall from (11) that, in the general case, if 4}, and B,, have the same 
sign, then dd/dz also has that sign. Thus, as summarized in table 1, 
dé/dm is negative in region J and is positive in region J/J. In region 
II, X; is positive, but the sign of B, is indeterminate without fur- 
ther assumptions. If we assume that c” = ™, then the sign of dd/dia 
is the same as the sign of A, and hence is positive. However, if we 
assume that there are no adjustment costs, then the critical value is 
V1 —[7/(1 — k)], and B, is negative. Hence, in the absence of adjust- 
ment costs, dd/d7 is negative in region II. 

To summarize, if the nominal discount rate exceeds the rate of 
depreciation, then an increase in the rate of inflation reduces the 
optimal 6 and hence leads to more durable capital. On the other hand, 
if the nominal discount rate is less than V1 — [7/(1 — k)] times the 
depreciation rate, an increase in the rate of inflation leads to less 
durable capital. In the case in which there are no adjustment costs and 
the investment tax credit is zero (as in Auerbach), an increase in the 
rate of inflation increases the durability of capital if p + 77> V1—76 
but reduces the durability of capital if p + m< V1 —76. 

It is interesting to compare the results for the case with zero ad- 
justment cost and zero investment tax credit with the simulation 
results in Auerbach. Note that these two models are not identical in 
that they treat consumption decisions,and money-holding decisions 
differently. The essential difference is that, in the optimizing model 
of this paper, p is held constant, whereas p is a variable in the 
Auerbach model. In Auerbach’s table 1, the nominal discount rate 
exceeds the depreciation rate in every row except the first. Consistent 
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with the results derived above, an increase in the rate of inflation 
increases the durability of capital in Auerbach’s simulations. 

In the first row of Auerbach’s table 1, the nominal discount rate 
falls short of V1 — 7 8, and the results derived above would indicate 
that d5/dzr is positive. However, 6 decreases when 77 is increased from 
row | to row 2. There are two possible explanations for this apparent 
contradiction. First, although the ratio (9 + 7)/6 begins below the 
critical value V1 — [7/(1 — k)] in row 1, it ends up above this critical 
value in row 2. Hence, in raising the rate of inflation from zero to 5 
percent, d6/d7 changes sign, and it may be that the negative values of 
d8/dm dominate the positive values over this range. Second, Auer- 
bach’s table 1 indicates a drop in the real discount rate p when going 
from 7 = 0 to m = 5 percent. The results derived in this paper have 
assumed that p is constant. However, as shown below, a decrease in p 
will tend to cause a decrease in 6, consistent with the results in 
Auerbach’s table 1. 

To calculate the effect of p on the depreciation rate 6, simply 
replace \, by Ap and dj by Ay in (11). Differentiating A with respect to 
p, we obtain 


_ -(1- 1A) _ 75 
do (pit o)- (9 + m + 8)? So oe 


Note that A, is negative since an increase in the real discount rate 
reduces the present value of future rentals (including depreciation 
deductions) to capital. 

Differentiating \’ with respect to p, we obtain 


Pee oriole RAGE Peel ets ae 

hie = 

Foc pueda see eg eee 
(23) 


Evaluating this derivative where \’ = 0, we substitute 7(p + 7)/(p + 7 
+ 6) for [—(1 — 7)b/(p + 8)]{A'(8) — [A(6)/(p + 8)]} to obtain 


ye eae OE (5+ 22% } 


Be a ae oe 


which is positive for 7 > 0. To calculate B,, substitute \, and Aj into B, 
= —\gAdp + AA, and use the substitution mentioned in footnote 15 to 
obtain ~ 





Under the reasonable assumption that 7 < 1 — k, it is the case that g — 
7 > 0 and, hence, B, > 0. Therefore, since \; > 0 and B, > 0, we 
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determine from the appropriately modified version of (11) that dd/dp 
> 0. Therefore, as mentioned above, a decrease in p tends to decrease 
the optimal value of 6. 

We have shown that the direction of the effect of inflation on 
durability depends on relative magnitudes of the nominal discount 
rate and the depreciation rate. We now examine the effect of inflation 
on the steady-state flow of capital services by examining the effect on 
o, the marginal product of capital services. Using Cramer’s rule, we 
obtain 

oa iS ~ (1 = r)c"g’ a A'(8) — “| Nema AO Nhe - (26) 
Note that the coefficient of \4, is positive and that the coefficient of A, 
is negative. Furthermore, recall that d, is negative. Therefore, dd/da 
is positive if either c” = 0 or Aj, is positive. Since ¢ is a decreasing 
function of the flow of capital services, it is clear that, in the absence of 
adjustment costs, an increase in the rate of inflation reduces the flow 
of capital services, even though the durability of capital may increase 
or decrease.'® 


IV. Concluding Remarks 


By replacing Auerbach’s proportional consumption function with 
consumption behavior determined by intertemporally optimizing 
consumers, we can derive strong results about the direction of the 
effect of inflation on durability without appealing to a simulation 
analysis. If, in the absence of installation costs, (p + 7r)/6 (the ratio of 
the nominal discount rate to the rate of depreciation) exceeds a 
certain critical value, then d6/dz < 0, so that an increase in the rate of 
inflation leads to more durable capital; if this ratio is less than the 
critical value, then d8/dz > 0. As explained in Section III, these results 
are broadly consistent with Auerbach’s simulation results in which 
dé/dm < 0 since Auerbach’s simulations are confined to the region in 
which (p + 7r)/6 exceeds the critical value. However, for less durable 
capital, the ratio (p + 7)/6 may fall short of its critical value, and the 
model presented here indicates that d6/da would be positive. Regard- 
less of whether an increase in the rate of inflation increases or de- 
creases durability, we find that, in the absence of installation costs, an 
increase in the rate of inflation reduces the steady-state flow of capital 


services. 


16 For the case in which c" = ~, sign dd/dm = sign \1, = sign d6/dz. In this case, the 
steady-state rate of investment is invariant to the rate of inflation. If an increase in 7 
increases (decreases) 5, then the steady-state flow of capital services [4(6)/85](//p) is 
reduced (increased), thereby increasing (decreasing) ¢. 
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Uncertain Lifetime, Consumption, and 
Dissaving in Retirement 





James B. Davies 


University of Western Ontario 


This paper asks whether the continued accumulation, or mild dis- 
saving, observed among the retired can be explained by uncertain 
lifetime. In the absence of annuities, after an initial period in- 
fluenced by borrowing constraints, under constant relative risk aver- 
sion, uncertain lifetime depresses consumption by a proportion 
increasing with age if the elasticity of intertemporal substitution in 
consumption is “small.” Illustrative computations, based on actual 
income and survival data, show that plausible elasticities are suf- 
ficiently small to give this effect. The reduction in consumption is 
large enough to explain much of the lack of decumulation by the 
elderly. 


I. Introduction 


It has frequently been observed that the elderly either continue to 
save in retirement or decumulate much more slowly than would be 
predicted by the crude life-cycle model without bequest motive under 
certainty. Most recently, Mirer (1979) has found that, controlling for 
education, in cross section the mean net worth of American couples 
continues to rise in retirement.’ When secular growth is taken into 


I would like to thank Glenn MacDonald, Nigel Tomes, Geoffrey Carliner, Chris 
Robinson, and John Whalley, as well as the participants in the Labor Workshop at the 
University of Western Ontario and a seminar at McMaster University, for helpful 
comments. The responsibility for any errors or omissions is my own. 

1 Shorrocks (1975) points out that the slope of the age-wealth profile is biased upward 
by the lower mortality of the wealthy. Mirer (1979) argues that controlling for educa- 
tion largely corrects for this. While it is not clear that explicit corrections based on 
éstimates of differential mortality by social class provide more reliable results (Shor- 
rocks 1975, p. 163), it should be noted that Shorrocks obtained fairly rapid decumula- 
tion in retirement for the United Kingdom by using this alternative. 


[Journal of Political Economy, 1981, vol. 89, no. 3] 
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account, this implies an increase, a fortiori, for a cohort. Almost as 
startling are the positive, or only slightly negative, rates of accumula- 
tion found in surveys of saving.” However, few, even among the old, 
say they are saving for bequest.* Recent work suggests that this may be 
because human as well as nonhuman intergenerational transfers are 
possible, and many families are at a corner where only the former are 
desired (see Drazen 1978; Becker and Tomes 1979; Adams, forth- 
coming). Thus a bequest-motive explanation of the slow dissaving of 
the elderly is not as attractive as it might seem. 

This paper shows that the observed slow decumulation of the el- 
derly may to a large extent be explained by uncertain lifetime. It does 
so by developing a model in which the effect of this kind of uncer- 
tainty on consumption by persons without a bequest motive can be 
assessed at different ages. Constant relative risk aversion is assumed 
for simplicity, but an appendix (obtainable from the author) shows 
that the results also hold under constant absolute risk aversion. Non- 
pension annuities are ignored on the argument that a no-bequest 
life-cycle model is consistent with their observed unimportance in 
household portfolios only if available annuities are not sufficiently 
attractive to dominate regular saving instruments for most consum- 
ers. With the most plausible taste parameters and rates of return, in 
this model in the absence of pensions uncertain lifetime will not only 
depress consumption at all ages but will also have an increasingly 
severe proportional impact beyond middle age. Illustrative computa- 
tions, based on actual earnings and survival data, show that this 
outcome is also likely with realistic pensions and that the effects are 
sizable. On conservative assumptions, uncertain lifetime may more 
than halve the mean rate of decumulation among the retired. 

The paper is organized as follows. Section II reviews previous work 


* In three major studies, the following ratios of mean saving to mean net worth for 
those over 65 were found: 











Rate 
Source Country Year (%) 
Lydall (1955, pp. 143, 147) WEKe 1952 = eS) 
Projector and Weiss (1966, p. 110) 
and Projector (1968, p. 215) WES: 1963 +,2 
Statistics Canada (1973a, p. 84; 
19736, p. 140) Canada 1970 +.8 





* Only 4 percent of the respondents to the 1962 Survey of Financial Characteristics in 
the United States cited “providing an estate” as a saving objective (Projector and Weiss 
1966, table A30). The 1964 Brookings Survey of affluent families (incomes above 
$10,000) found only 23 percent (of all ages) who were saving to make a bequest 
(Barlow, Brazer, and Morgan 1966, p. 198). 
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on the impact of uncertain lifetime on consumption. Section III 
presents the model, Section IV the illustrative computations. 


II. Previous Work 


Yaari (1965) studied the impact of uncertain lifetime on the planned 
rate of change of consumption, examining four cases defined by the 
presence or absence of a bequest motive and actuarially fair insur- 
ance. Except in the cases with a bequest motive, insurance takes the 
form of either life-insured net borrowing or annuities. Since under 
actuarial fairness the interest rate on the latter exceeds that on “regu- 
lar” savings, consumers without a bequest motive hold their entire 
wealth in the form of annuities.? 

Omitting a bequest motive, Yaari assumed that a consumer aged ¢, 
with probability P (z t) of surviving to age 7 and a maximum lifetime 
of T, would have the expected utility integral: 


1 
V= i P(r | t)a(r)ULC(7)]dr (1) 


where C(7) is consumption at 7, a@(7) is a subjective discount factor, and 
the rate of time preference is 1 — P (7 | t)a(t). 

Maximizing (1) subject to the appropriate constraints yields the 
following results. In the absence of insurance, since the rate of time 
preference is higher than under certainty [P (7 | ¢) < 1], consumption 
tends to grow more slowly throughout the life cycle. However, ac- 
tuarially fair insurance makes the price of future consumption P (z | t) 
exp [—1(7 — t)], where r is the (constant) rate of interest, that is just low 
enough to offset completely the higher time preference. The result is 
that, with insurance, consumption grows at about the same rate under 
uncertainty as under certainty.° 

It is important to note that Yaari’s conclusions concerned only the 
shape of the lifetime consumption path and not its level. Levhari and 
Mirman (1977, referred to as LM below) have extended Yaari’s work 
by analyzing levels in the absence of insurance. They assume constant 
rates of subjective discount and relative risk aversion, giving: 


4 The rate of return on an annuity of a year’s duration would equal the rate on 
regular savings plus the probability of death during that year (see Yaari 1965, p. 144). 

> The actual results are that C(t) = —[r + a(t)|(U'/U") under both certainty and 
uncertainty with insurance, while C(t) = —[r + a) — Pt | t)\(U'/U") under uncertainty 
without insurance. (A dot denotes a time derivative and a hat a proportional rate of 
change with time; U’ and U" are first and secorid derivatives of U with respect to C[t].) 
Since C(t) will differ in the three cases, and U'/U" is in general not constant, we can say 
only, loosely, that under uncertainty consumption tends to grow at the same rate as 
under certainty with insurance but more slowly without. 
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Cir 


ea dt. (2) 


T 
V =| P(r|t) exp [—p(r — t)] 
Here —y is the elasticity of marginal utility with respect to consump- 
tion, and y is the index of relative risk aversion. The elasticity of 
intertemporal substitution in consumption, @, is equal to 1/y. 

Maximizing (2) when resources are composed exclusively of non- 
human wealth, LM find that the impact of uncertain lifetime is a 
priori indeterminate. In the special case where r = p = 0, initial 
consumption will be higher than under certainty if y < 1 and lower if 
y > 1. Whenr and p are nonzero, there are a number of distinct cases. 
It is argued in Section IV of this paper that the most “realistic” is 
where y > 1. In this case, unless r or p (or both) is very high, increased 
uncertainty always leads to a decline in initial consumption.® As shown 
below, this conclusion does not extend to the case where resources are 
largely in “human” form. 

Barro and Friedman (1977) have drawn attention to the fact that, 
under (2), with actuarially fair insurance, consumption grows at the 
same rate under uncertain lifetime as under certainty. They have also 
pointed out that if earnings did not change with age, andr and p were 
zero, consumption would be constant throughout life at the same level 
under certainty and uncertainty. This special result appears to be the 
only available analysis of the impact of uncertain lifetime on the level 
of consumption, in the presence of insurance.’ 

These previous contributions might be criticized for focusing either 
on a world without insurance and annuities or on one in which they 
are actuarially fair. Both situations are clearly “unrealistic.” Must we, 
then, model insurance with an equilibrium load to make further 
progress? In the present context, the answer is no. Aside from con- 
siderations of analytical difficulty, the exercise is not necessary if we 
wish to discover whether the life-cycle model without a bequest motive 
can explain the slow dissaving of the old. Nonpension annuities are in 
fact of little importance for savers of any age.* There are two possible 


6 Levhari and Mirman 1977, p. 275. Note that LM did not regard this case as 
especially significant. In fact, in their abstract they state “the major result is that if the 
utility function is Cobb-Douglas and the rate of return is not too large relative to the 
amount of future discounting then lifetime uncertainty will always increase consump- 
tion,” which refers to the case where y = 1. 

7 An appendix to this paper which analyzes the case with actuarially fair insurance is 
available from the author. The illustrative computations show that, although, theoreti- 
cally, a positive or negative impact of uncertain lifetime on consumption is possible, a 
slight negative impact is likely at all ages. There is no tendency, however, for this to be 
more severe for the elderly. 

8 In the 1962 Survey of Financial Characteristics only 1 percent of households reported 
any private annuities, and on average these composed less than 0.1 percent of the 
household portfolio. Surprisingly, the figures are not any higher for those aged 65+ 
(Projector and Weiss 1966, tables A8 and A31). 
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explanations. Either there is a widespread strong desire to make 
bequests, or insurance markets fail to provide annuities sufficiently 
attractive to outweigh the greater transactions costs and incon- 
venience of saving in this form.? Thus the only case in which the 
life-cycle model without a bequest motive could explain the low con- 
sumption of the old is that where regular saving instruments gen- 
erally dominate annuities. Accordingly, for analytical simplicity it 
is assumed here that regular savings always dominate annuities. 


III. The Impact of Uncertain Lifetime at Different Ages 


The object of this paper is to determine the impact of uncertain 
lifetime on consumption at different ages. One might suppose the 
appropriate procedure would be to compare complete lifetime plans 
under certainty and uncertainty. Such a comparison would not, how- 
ever, tell us the impact of uncertainty on an individual of arbitrary age 
with given current wealth and other characteristics. It would indicate 
only differences in projected consumption at various ages for indi- 
viduals with given wealth at the beginning of their lifetimes.! 

The impact of uncertain lifetime on consumption by an individual 
of aget with wealth, W(t), and a noninvestment income stream, FE (T),¢ 
<7 <T, can be measured by the difference between what he would 
consume if his lifetime were certain, C¢(t), and if it were uncertain, 
C(t). (The argument ¢ will be omitted in the text below when no 
ambiguity arises.) In making this comparison we must vary only the 
degree of uncertainty. In particular, the expected lifetime, 


T 
p(t) =f P(r|t)dr +t, (3) 


must be held constant. 


Consumption under Certainty 


Under certainty a consumer of arbitrary age, t, would maximize (2) 
subject to the constraint that discounted consumption up to p(t), K“(¢), 
could not exceed total current resources, R(t). Resources would be 
composed of current wealth, W(t), and the discounted value of nonin- 


° There is a significant problem of adverse selection since survival chances clearly 
differ widely but may be quite difficult for an insurance company to ascertain. 

10 As mentioned previously, Yaari (1965) showed that under uncertainty consump- 
tion tends to grow less quickly than under certainty. Thus, irrespective of the difference 
in initial consumption (say, at age 20), by retirement, consumption under a complete 
lifetime plan will almost certainly be lower under uncertainty. This has no bearing, 
however, on whether an individual aged, say, 65, of given wealth, consumes more or 
less than he would under certainty. 
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vestment income, E(t), t <7 < p(t), denoted H“(t). Adding the termi- 
nal condition W[w(t)] = 0, we have the constraint: 
M(t) 
W(t) + H°(¢) — C%(7) exp [—r(r — 1) dr = 0, (4) 
t 
or 
R¢(t) — K°(t) = 0. 


This is a standard problem in optimal control. Consumpton will grow 
according to 





Ch) Pres g, (5) 
7 
where the hat denotes a proportional rate of change with time (as it 
will throughout). Equation (5) has the obvious solution: 


C¢(r) = exp [g(r — t)IC“(@). (6) 


Substituting (6) into (4) we obtain:” 


Mo) = 
Co(t) = if Exp (21) itr} Ret) =PCe@)Re@. (7) 
t 
The term PC‘, the average “propensity to consume” out of resources, 
is just PC°(¢) = [K¢(t)/Ce(t)|-! = K¢(t)~1, that is, the inverse of dis- 
counted lifetime consumption normalized by initial consumption. It is 
helpful to view PC® in this light since it is usually easy to see how a 
change in uncertainty affects “normalized” lifetime consumption. 
The latter simply reflects the shape of the consumption path and the 
strength of discounting. 


Consumption under Uncertainty 


Under uncertain lifetime, the consumer maximizes (2) subject to the 
constraint that wealth may never become negative. (To lend without 
security to a person whose lifetime is uncertain would be to provide a 
form of insurance.) As Yaari pointed out, under this constraint the 
optimal plan consists of a series of intervals where the constraint is 
effective and others where it is not. I refer to these as “blocked” and 
“free,” respectively. In the LM analysis, where all resources are in the 
form of an initial stock of nonhuman assets, the entire lifetime is a 
free interval. Here matters are more complicated. 

Over blocked intervals we have, of course, C“ = E. In addition, it is 
clear that the consumption plan is continuous at the beginning, ¢*, 


"Eq. (7) is not simplified, in order to emphasize its similarity to the corresponding 
expressions under uncertainty, eqq. (10) and (13). 
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and end, ¢* + m, of any free interval, with C“(t*) = E(t*) and C“(t* + m) 
= E(* + m).” Over the course of a free interval, consumption 
fae according to 


OM) = rp + Pol (8) 


(Yaari 1965, p. 143).*% That is, consumption grows less quickly than 
under certainty because expected utility from future consumption 
declines without any corresponding drop in price. Solving (8), we 
obtain: 


C¥(r) = P(r|t)"!” exp [g(r — OIC"), * <t <7 <t* +m. (9) 
Recalling that wealth must be exhausted over the interval, we have: 


me?” E(t) exp [-r(t — ¢*)]dt 


= ee (10) 
mo” P(t | t*)"” exp [(g — r)(t — ¢*)]dt 


Curr) = 


Equations (9) and (10) completely specify the free interval when t* + 
m = T. In the case where t* + m <T, we have a further relationship. 
Since C«(*) = E(t*) and C«(® + m) = E(t* + m), given (9), E(¢*) and 
E(t* + m) are related as follows: 


E(t* +m) 


OG PUG +m) |” exp (gm) we 


An indication of the range of plans that may be obtained with 
realistic E and P paths and plausible values of r, p, and y is provided 
by figure 1. The plans shown are the result of the illustrative compu- 
tations of the next section. Note that in all cases life begins with a 
blocked interva! and there is just one free interval.’* Part A indicates 
that with E = 0 beyond age 65 (no pensions), the free interval would 
extend until T. That is, wealth would not be run to zero as long as 
there was a nonzero probability of surviving any longer. In part B, the 
consequences of realistic pensions are shown. With a high y, the 
consumption path is relatively flat (see [9]) and a free interval lasting 


eats (11) 


2 Tf C“(t*) < E(t*), expected utility could be increased by raising consumption at ¢* 
and reducing it just before. A similar argument holds for t* + m. 

13 Note that, since P(r|7) = 1, P(r) = P(r|7). 

14 In general, the consumption plan will not always have these features. The initial 
blocked interval is caused here by E being low at the outset. A second free interval does 
not succeed the first because, beyond ¢* + m, the desired growth rate of consumption is 
declining because of the rapid fall of P, whereas pensions (if present) fall at a constant 
rate of 2.25 percent per year. (See the next section for an explanation of this rate.) An 
increase in the ratio of the growth rates of desired consumption and E would be 
required for a second free interval to emerge. 
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A. Without pensions 


Thousands of dollars 


Thousands of dollars 





20 40 60 80 100 
Age (years) 


Fic. 1.—Selected optimal consumption paths, r = .03, p = .015 


until T is obtained, whereas with y = 1 the desired growth rate of 
consumption declines quickly and a second blocked interval occurs. 


Behavior of C*/C* if Free Interval Lasts until T 


How does C“ compare with C°¢ at different ages? Consider, first, 
situations where the free interval lasts until 7. At any point in the free 
interval, discounted consumption up to T cannot exceed total “re- 
sources,’ R”, that is, the sum of current wealth and the present value 


of the E stream up to T, H“. Adding the terminal condition W(T) = 0, 
we have: 


W(t) + H(t) mle C“(r) exp [—r(r — t)]dt = 0, (12) 
or R(t) — K“(t) = 0. Substituting (9) into (12), we obtain: 
T -1 
CHt) = J Peo exp lig ner - De| RX) = PCXOR(). 


(13) 
Again, PC“(t) = K%(t)"}. 
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TABLE 1 
ANALYSIS OF PC“/PC° 











g and r y PC"/PC°¢ 
=< >1 if y sufficiently below 1, 
otherwise <1 
re Ee a = al 
=I <i 
| <1 Z| 
eS = x = =I 
>1 << 
<1 >1 
e <9 = Sal 
= <1 if y sufficiently large, 


otherwise >1 





The behavior of C“/C¢ is determined by that of R“/R° and PC“PC°. 
When the free interval lasts until T, in the presence of pensions we 
have R“/R* > 1. However, as will be shown, we may have PC“/PC¢ < 1, 
so that C“/C° < 1 is a possible outcome. 

From equations (7) and (13), we see that PC“PC* depends on the 
relative size of g and r and the value of y. In Section IV it is argued 
that y is typically considerably greater than unity. It is observed that 
apparently non-credit-constrained households consume at an in- 
creasing rate. As long as utility is additively separable over time, this is 
only consistent with an interest rate greater than the subjective dis- 
count rate.’° Thus realistic restrictions include y > 1 and p < r. 
Further, if y is typically considerably above unity, unless p is far below 
zero, which seems unrealistic, we have g < r. 

Levhari and Mirman have provided a full analysis of the relative 
values of PC” and PC*. Their conclusions are set out in table 1. When 
g =rand y = 1, recalling (3), we find that (7) and (13) imply that PC“ 
= PC*. Lowering y below 1 raises 1/y and increases PC“/PC*, while 
raising it does the opposite. With g > r and y = 1, PC“/PC* < 1] since 
greater relative weight is placed on the more remote future in the 
denominator of PC“ than in that of PC*. Raising y simply reinforces 
this. On the other hand, although reducing y below 1 tends to in- 
crease PC“/PC‘, the ratio will not rise above unity until y is sufficiently 
small, so that a simple result is not obtained. 

Now, in the apparently realistic situation where g < r, we have 
PC“/PC¢ > 1 with y < 1, again because more weight is placed on the 
more remote future in the denominator of PC”. However, raising y 

15 In Yaari’s framework, which does not require constant risk aversion or constant 


subjective discounting, consumption will not rise unless the rate of return exceeds the 
subjective rate of discount; see Yaari (1965), pp. 143, 147. 
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above unity reduces 1/y and tends to make PC” fall. Clearly, with 
sufficiently large y we obtain PC“/PC° < 1. In the next section we find 
that with actual earnings and survival data, y does not have to be far 
above unity for this result to occur. 

In order to study changes in PC“/PC* with age, one may compare 
the proportional rates of change:'® 


PCE) = PC()(1 — exp L(g — Nlu@) — Ma) + (g-7), (4) 


and 
POM) = PCM) + = + (g—7). (15) 
; sae ON nO 
To interpret these relationships, it is helpful to note that PC’ = —K'. 


Consider first a situation where the lifetime was actually fixed and g = 
r. In this case, normalized lifetime consumption, K', would simply 
equal the remaining life expectancy, y(t) — t. Each year K‘ would fall 
by a percent equal to its own inverse. Hence PC' = (K')~! would rise by 
this same percent, that is, PC’ itself. 

When allowing g # r, say g <r, we give K' some tendency to rise as t 
increases since weaker discounting (compounding, if g > 1) is applied. 
This gives rise to the (g — r) in both (14) and (15). Finally, in fact 
lifetime is not fixed. Under certainty, the extension in possible 
lifetime increases normalized lifetime consumption, reducing PC*, 
while under uncertainty, the increased probability of survival to any 
future date also raises K‘ and reduces PC’. 

Equations (14) and (15) show that, in the main, whether PC“/PC¢ 
tends to rise or fall depends on whether it starts out greater or less 
than unity. When PC“/PC* < 1, a decline may not necessarily occur. 
However, since p(t) and P(t) are both typically quite small, in our 
“most realistic” case, of g <r and y well above unity, it is likely that 
PC“/PC© will be low enough to ensure its continuous decline. Whether 
this translates into a decline in C“/C¢ depends entirely on the behavior 
One 

In the absence of pension income, since most people retire before 
the age at which death was expected, even from the viewpoint of the 
start of adult life, R“ is typically the same as R°. Hence everything said 
above about PC"/PC* applies to C“/C°. That is, with the most plausible 
parameters C“/C° < | and declines monotonically with age over the 
free interval. In addition, it is clear that C“/C* < 1 during the initial 
blocked interval (C” will be less than the “free” value given by eq. [13)]), 
although the ratio would likely typically increase with age prior to ¢* 
since E rises quickly for the young. (Recall C“ = E in the blocked 


'6 In deriving (15) let 7(7) = the unconditional probability of surviving to ager. Then 
P(r|t) = a(z)/r(t), and OP(r|t)/at = —a(r)m(t)2r(t), or —P(r|t)P(2). 
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interval.) These conclusions are particularly interesting since many of 
the studies indicating slow decumulation by the old were conducted 
when pensions were less sizable and widespread than they are today. 
When pension income is present, we of course have RX/R¢ > 1; and 
even when the free interval continues to last until J, the above conclu- 
sions will hold only with a higher value of y. In the next section we 
find that the required level of y is not, however, implausible. Here we 
should note that, although R“/R¢ > 1, it has some tendency to fall with 
age. The R' has two components, W and H’, the former of which we 
hold constant in the comparison of certainty and uncertainty. Since 
most often W is rising and H' is falling, RY and R¢ have a basic 
tendency to become more equal. In the next section we find that in 
practice this dominates, and R“/R¢ typically falls with age. It should be 
noted, however, that the behavior of H“/H° is in opposition. We have 


E(t) 


A“(t) =r-—- Ho” 


(16) 





and 


H(t) 
Since E [pz (t)] exp { —r[(t) — t]} a(t) is usually small relative toF (¢) and 
H"“ > H°, this means that H“ tends to fall more slowly than H°. 


Behavior of CY/C° if Free Interval Ends before 'T 


As we saw in figure 1, in the presence of pensions the free interval 
may terminate before T. In this case, it is likely that C“/C° > 1 
throughout the free, as well as the following blocked, intervals. 

In the computations of the next section, in cases where the free 
interval ends before T, we usually find that p(¢*) < ¢* + m. Hence p(t) 
must rise and become equal to ‘* + m at some point in the interval. At 
this age C” exceeds C°. In general, we have: 


W(t) + fo” E(x) exp [—r(r — tld 


C(t) = aT = ht (18) 
J. P|” exp Ug — nr — ddr 
and 
Lt) i 
Fee eae ESD TST pes ree (19) 


f° exp le — rr — dr 


For p(t) = t* + m relevant resources are the same, but the de- 
nominator is smaller under uncertainty, and C“ > C°. 
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What can we say about C“/C¢ at other points? Before the age where 
p(t) = t* + m, the longer horizon under uncertainty makes the 
denominator of (18) larger relative to that of (19). However, relevant 
human resources in the numerator of (18) are greater than those in 
(19). Similarly, above the point where p(t) = ¢* +m, the denominator 
of (18) falls relative to that of (19) but the numerator does the same. 
In the illustrative computations, the net result is often: thats =.C" 
throughout the free interval. 

Finally, after the free interval we are likely to have C“/C* > | again. 
To see this, consider a case where ¢* + m occurs in retirement and the 
pension changes at a constant percentage rate. As long as g exceeds 
this rate, which appears likely, positive saving will be desired under 
certainty, andiGo <=) E =1C% 


IV. Quantitative Impact 


This section provides a quantitative study of the likely impact of 
uncertainty on consumption at different ages. It does this by com- 
puting hypothetical consumption plans under certainty and uncer- 
tainty at different ages, using actual data on noninvestment income 
and survival probabilities and a range of values ofr, p, and y. The first 
step is to compute the complete lifetime consumption plan under 
uncertainty. This generates a time path for W(t). Next, at intervals of 
1 year I ask how much a person would consume, given the W() 
computed in the first step and a certain lifetime equal to life expec- 
tancy at that age. 

Figures 2 and 3 plot the data used. An inverse logistic curve fits the 
P path in figure 2 well and allows the use of continuous time in the 
computations.!” Similarly, the lifetime path of mean noninvestment 
income in figure 3 is approximated from ages 20 to 70 by a fourth- 
order polynomial.’* Beyond age 70, it is assumed that E declines at the 
observed trend rate of 4.25 percent per year. It is also assumed that 
secular growth shifts the entire cross-section profile upward at a rate 
of 2 percent per year. This explains why the FE path for an indwidual 
shown in figure | differs from the cross-section profile of figure 3. 

The computations were carried out with alternative values of r, but 
I focus mainly on the case where r = .03, suggested by popular 
estimates of mean household rates of return (see, e.g., Boskin 1978, p. 
S19; Feldstein, Green, and Sheshinski 1978, p. S64). The subjective 


a The curve P (t) = [(1 + a) exp (—dt)]/[1 + a exp (ct)] was fitted by least squares, using 
numerical methods (R* = .999). Values of the parameters are a = 1.00031, b = 
—0.002114, and c = 0.14651. 

18 The curve, E(t) = —36,999.4 + 3520.22t — 101.8782? + 1.34816t3 — 0.0070623324, 
was fitted by ordinary least squares (R? = .999). 
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Probability of survival to given age 


Oo 
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Age (years) 
Source: Statistics Canada (1974, pp. 14-15) 
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Source: Revenue Canada (1973, pp. 50-59) 


Fics. 2-3.—Fig. 2, Male survival probabilities (Canada 1971). Fig. 3, Average male 
noninvestment income (Canada 1971). 


rate of discount is limited to the arguably “realistic” range 0 S$ p <r. 
(It was suggested in the previous section that p > r is implausible, 
while allowing p < 0 simply reinforces the results obtained below.) 
The parameter y is allowed to vary from 0.5 to 5.0, reflecting both a 
desire to cover the alternatives y = 1 and considerable evidence that y 
is typically much above unity. 

A variety of evidence on y is available. Friend and Blume (1975, p. 
920) estimated it in the context of risk aversion, using data on house- 
hold asset holdings, and found that it was likely to be in excess of 2.0. 
Farber (1978), attempting to infer the preferences of members of the 
United Mine Workers from the results of collective bargaining in the 
coal industry, also estimated y as the’ index of relative risk aversion, 
obtaining alternative estimates of 3.0 and 3:7 (p.935). Further éyi- 
dence is provided by Ghez and Becker (1975, p. 133) and MaCurdy 
(1979, p. 34), who, in studies of the allocation of goods and time over 
the life cycle, estimated upper bounds on o of .28 and .33, re- 
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TABLE 2 


C¥/C° wiTHouT PENSIONS 

















y 
ne r= .015 = .045 

AGE 5 1 2 3 4 5 4 4 

p=0 

20 1.060* 441* 306% .274* 260" .252* .236%* 285% 

30 1.264 1.000 903 872 857 835*  .794 886% 

40 1.305 1.000 875 833 811 .798 748 863 

50 1.375 1.000 832 775 746 728 684 800 

60 1.498 1.000 al 695 656 633 .600 .709 

70 1.678 1.000 702 609 564 538 518 .609 

80 1.815 1.000 .668 571 525 .499 .490 559 

p=r/2 

20 441% 306* .260* .247* .241* 937% 996% 257% 

30 1.216 986 857%  .820* .802*  .792*  .798* .810* 

40 1.261 1.028 903 855 830 814 .760 885 

50 1.342 1.032 862 .798 765 744 695 823 

60 1.481 1.038 802 718 674 648 610 733 

70 1.675 1.041 731 629 579 550 526 630 

80 1.823 1.039 691 586 537 508 .496 576 

finial Sa MEAT Perit Nii dak S408? Ree ene ian 

20 294% 294% 994% 994% 994% 994% 917% 234% 

30 175 2* 752% 752% 752% 759% 759% 76 9* 743% 

40 1.064* 1.044 927 876 847 829 B7i7il .904 

50 1.293 1.054 889 820 782 759 .705 845 

60 1.446 1.067 830 739 692 662 619 756 

70 1.657 1.076 758 649 594 562 533 652 

80 1.821 1.074 714 602 548 517 502 593 





* Nonnegative wealth constraint effective under uncertainty. 


spectively, implying y > 3.6. Finally, recent studies of the elasticity of 
aggregate saving with respect to the real rate of interest suggest high 
values of y as well. Boskin (1978, p. S4), for example, estimates the 
uncompensated elasticity to lie in the range 0.3—0.4 in the United 
States. Including pensions, with r = .03 and p = .015, my model 
implies uncompensated aggregate saving elasticities of 0.9, 0.5, and 
0.3 with y = 3, 4, and 5, respectively. 

To sum up: The available evidence suggests that y may typically be 
above 3; that a value of 4 may be a “best guess” at the true figure; and 
that even y = 5 could not be rejected out of hand as implausible. 

Table 2 presents the illustrative computations for the case where E 
is set to zero beyond age 65, that is, where there are no pensions. A 
value of y below 1 gives C“/C* > 1 and rising with age in the free 


** Alternative values of r and p yield similar results. Using y = 4, with r = .06 and p = 
-03 the elasticity is 0.6, while with r = .03 and p = .0 we get a figure of 0.4, e.g. 
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TABLE 3 


C“/C© wtTH PENSIONS 









































Y 
r= .03 r=.015 r= .045 

AGE 5 ] 2 3 4 5 4 4 

p=9 
20 .997* .415* .288* 2 Oe AD ee 31/7 .274* 
30 1.346 1.089 .909* .827* .790* .769* .740* .836* 
40 1.397 1.109 .992 -946 .922 .908 915 .935 
50 1.491 1.145 .988 .922 .888 .867 .886 .899 
60 1.669 2 .983 885 835 .805 .842 .840 
70 1.965 337 .994 844 IP .730 .795 .766 
80 1.808 1.500 1.065 843+ .7487 6967 = .793F .726F 

p=r/2 
20 -415* .288* .245* Tye PETE Sass SAW if .247* 
30 We232% .909* .790* HS SE KO erAlle .764* 
40 1.330 1.168 1.043* .974 -945 .926 .929 .960 
50 1.437 1.178 1.030 .959 .917 891 .904 .933 
60 1.618 1.266 1.046 .937 .874 835 .865 888 
70 1.705 1.397 1.090 917 .822 .767 .825 827 
80 2227 1.230 1.193 9437 .807F .737¢ ~ .830T 7984 

p=r 
20 .210* .210* .210* PAU Pale — Pile — allege .224* 
30 .693* .693* .693* .693* .693*  .693*  .688* .701* 
40 .935* .935* .935* .935*  .935*  .935* .943 .925* 
50 1.138* 1.138* 1.056 991 .944 .914 .921 .964 
60 1.398* 1.263 1.089 .985 .912 .866 .889 .934 
70 Pai23* 1.318 1.154 .992 875 .807 .857 895 
80 1.088* 1.088* 1.220 1.053 876 =.784¢ 8 71F 8877 





* Nonnegative wealth constraint effective under uncertainty. 
+ No blocked interval succeeds free interval. 


interval, while y > 1 yields the opposite, as predicted in the previous 
section. With y = 3 dramatic results are obtained. Using the “best 
guess” figure of y = 4 and the combination r = .03 and p = .015, for 
example, maximum negative impacts on consumption are 23.5 and 
46.3 percent for the middle-aged and old, respectively. With the 
same, quite conservative, parameters, the mean ratio of dissaving to 
net worth for those aged 65-85 declines from 9.0 to 3.7 percent 
with the introduction of uncertainty. Note, finally, that the results are 
robust with respect to changes in 7. 

Table 3 shows that when noninvestment income in retirement is 
introduced, the ratio C“/C¢ rises in most cases. However, with y 2 3 
and p <r, the impact of uncertainty remains negative and much 
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larger for the retired than for the middle-aged. When we focus again 
on the r = .03, p = .015, y = 4 case, we see that, while uncertainty 
reduces consumption in middle age by up to 8.3 percent, it depresses 
that in retirement by up to 19.3 percent. Although this is a more 
modest effect than obtained without pensions, it is still impressive, as 
shown by the fact that with the same parameter values the average 
rate of decumulation between ages 65 and 85 falls from 7.0 to 2.9 
percent with the introduction of uncertainty.”° Furthermore, isolation 
of the r = .03, p = .015, y = 4 case is conservative. Small changes in 
parameter values can make the effect even more sizable. By reducing 
p to 0 or by raising y to 5, for example, we increase the maximum 
negative impact on consumption for the elderly to about 25 percent 
and reduce the mean rates of decumulation under uncertainty for 
those aged 65-85 to 2.4 and 2.1 percent, respectively. Finally, note 
that the results are again robust with respect to r.?! 


V. Conclusion 


This paper has shown that the life-cycle model without a bequest 
motive can help to explain one of the most interesting features of 
actual consumption behavior, the typical low rate of dissaving among 
the elderly, when uncertain lifetime is taken into account. Theory 
shows that, with the most plausible parameters, a negative impact of 
uncertain lifetime which increases in proportional severity with age is 
likely in the absence of pensions and may also occur when pensions 
are present. Illustrative computations based on actual income and 
survival data show that this effect is, in fact, obtained with or without 
pensions. Furthermore, the magnitude of the impact is sufficient to 
suggest that uncertain lifetime could provide the major element in a 
complete explanation of the slow decumulation of the retired. 
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What Kind of a Science Is Economics? 
A Review Article on Causality in Economics 
by John R. Hicks 
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As computational costs have declined, economists have used increas- 
ingly complicated statistical models in analyzing business cycles. Most 
of these models have been based on extensions of the IS-LM 
framework developed by Hicks, combined with the simultaneous 
equations statistical framework which was the central subject matter 
of econometrics. These large econometric models were, and still are, 
successful in certain important respects. By making possible the sys- 
tematic use of large amounts of statistical information and the 
codification of forecasting and projection procedures, they provided 
good forecasts—as good as or better than averages of “judgmental” 
forecasters. The models and their services have become commercially 
valuable. Nonetheless, they seem to many economists to have out- 
grown their underpinnings in economic and statistical theory. 

The economic theory entering the models does not in itself explain 
the existence of recurrent business cycles. In practice, the fitted mod- 
els usually explain much of what might be labeled “the character of 
the business cycle” as a consequence of properties of the “error terms” 
in the model about which theory has little to say. This is not entirely 
the fault of the economic theory. The simultaneous equations statisti- 
cal theory makes heavy demands in this area of application on sup- 
posed a priori knowledge. It is therefore perhaps not surprising that 
the actually available economic theory leaves large gaps to be filled 
with conventional or ad hoc assumptions. The burden of ad hoc 
assumptions and implicit statistical procedures carried by the models 
for these reasons weakens them somewhat as tools for forecasting and 
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policy projection. It weakens them still more as scienfific instruments 
when important economic issues are subject to dispute. 

Reflecting these sources of dissatisfaction are two current devel- 
opments in macroeconomic research. On the one hand, some 
economists are building stochastic theoretical models which have test- 
able probabilistic implications for the behavior of major aggregate 
time series as stochastic processes. On the other hand, econometri- 
cians have been looking at methods for reducing the ad hockery of 
the standard methodology by formalizing the procedures for control- 
ling the dimensionality of models and for accounting for drift in 
model structure. 

These developments seem to me natural and hopeful. They do, 
though, point to an increasingly complicated and intense interaction 
between theory and data, using probabilistic models. What is going on 
is not quite like what has gone on in any other science, though one can 
see some analogies. Is what is going on useful? What are the criteria 
for confirming or disconfirming models of this type? Are the formal 
and informal standards for reporting results in this area reasonable? 

According to its introduction, the book under review grew out of 
Hicks’s meditation on a 1974 conference on the microfoundations of 
macroeconomics, which he attended and saw as a failure. It also 
reflects his view that “econometrics is now in some disarray.” The 
book discusses scientific method in general and then in this light the 
special character of economics, especially macroeconomics, as science. 
As is evident from the book’s length (124 + xii pages), the argument is 
concise, and it is presented in a conversational tone. The central 
conclusions, as I see them, are that “as economics pushes on beyond 
‘statics’ it becomes less like science, and more like history” (p. xi) and 
that “the probability calculus is no excuse for forgetfulness” (p. 122). 

To explain completely what Hicks means by these phrases would 
require more space than I have in this review, but some explication 
is obviously required. The former idea is that dynamic economic 
theories must inherently be incomplete, imprecise, and therefore 
subject to variation over time. One reason for this is that economic 
cause-effect relations involve a “recognition delay” about which 
theory has little to say and which may be expected to be variable. For 
example, when a price change causes a change in an amount pur- 
chased, there will be a delay between the actual price change and 
consumers’ hearing of it, as well as a possible further delay before 
consumers decide to act on the information. It is wrong, then, to 
expect economic theories to be complete, mechanical, and divorced 
from reference to specific historical circumstances. 

The latter idea is that, particularly in economic time series, prob- 
ability models are often inappropriate because the data are interde- 
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pendent in ways which make treating them as “random” patently 
unjustifiable. Hicks argues that probability judgments in economics, 
both in microeconomic decision making and in evaluation of com- 
peting economic theories, often result in incomplete orderings, with 
many possibilities lying in a gray zone of noncomparable probabilities. 
Thus the usual axiomatic arguments for the probability calculus are 
unjustified in a wide range of economic applications. He cites as an 
example of this objection to the probability calculus in inference the 
situation in which a new fact which seems not to fit existing theory 
emerges, and a new but ad hoc theory is put forward to “explain” it. 
Because a priori and likelihood considerations point in different 
directions in this case, he argues that the two theories lie in the gray 
zone of noncomparability and that the reasonable action to take is to 
reserve judgment. Similar considerations, Hicks argues, suggest that 
economic behavioral models of decision making under uncertainty 
ought not to be formulated taking the usual probability calculus for 
granted. Probabilistic models and methods of inference are often 
awkward and misleading tools in application to economics. 

Hicks’s discussion is valuable for the insights it presents into the 
special character of economics as a science. Its conclusions, though, 
seem to me deeply mistaken, in part because the argument too easily 
resolves some of the problems it raises. 

My dissatisfaction with the Hicks book is easier to explain if we 
compare it with another recent book on a closely related subject— 
Chance, Cause, Reason, by Arthur W. Burks (1977). The Burks book is 
expansive (almost 700 pages), where Hicks’s is concise; it is densely 
formal in parts, where Hicks’s is persistently informal; it very seldom 
gives the impression of having chosen an easy resolution of a 
difficulty, where Hicks’s does. Burks, for example, considers at some 
length the objection to the probability calculus given central place by 
Hicks: that probability judgments do not yield a complete ordering. 
Burks argues that these objections are correct in principle but that 
they are in the nature of qualifications to the range of applicability of 
the calculus—when no decision has to be made, or when the decision 
to be made is unimportant, then it may not be “economic” (to use 
Burks’s word) to rank all probabilities and utilities. But (this is my 
conclusion, not Burks’s) in important issues of scientific inference or 
in economic decision making which involves substantial amounts of 
money and utility, it is both normatively and descriptively reasonable 
to treat probabilities and utilities as completely ordered. In taking 
decisions, we implicitly rank them in any case. 

I could go on citing Burks in rebuttal to parts of Hicks’s argument 
with which I disagree, but it would be better for the reader of this 
review to read Hicks and struggle with Burks (I for one cannot “read” 


REVIEW ARTICLE 581 


Burks in the ordinary sense of the term) and reach his own conclu- 
sions. For despite my sympathy with Burks’s style and with many of 
his arguments, I do not see his book as in any sense a substitute for or 
a refutation of Hicks. Burks is constructing a model of scientific 
inference, and it seems that the hard sciences are the focus of his 
interest. He rightly points to the need for some degree of uniformity 
in scientific laws over space and time if inference is to be possible and 
also rightly (in my view) claims that his model of inference applies 
even if there is some variation of laws in space and time. He rightly 
claims that there is a sense in which empirical probability statements 
(e.g., “the probability that this coin will come up heads is .5”) can be 
true, and from this point builds up an intersubjective (as opposed to 
Savage’s personalistic) theory of inference. Economists do often aim 
at influencing professional consensus as to what causal structure 
plausibly underlies historical data. If we are to think about how 
economists do this or ought to do this, an intersubjective theory like 
Burks’s fills an important gap in Savage’s subjective theory. 

But Hicks is right about some of the central differences between 
economics and other sciences—temporal instability of our theories is 
not a marginal qualification to our methods of inference but a central 
aspect of our difficulties in determining what is true. And treating 
“empirical truth” as a primitive notion seems too simple to lead to a 
theory applicable to economics. The problem of the “ad hoc theory,” 
arising from the dearth of “empirically true” probability statements in 
economics and the near impossibility of generating them by experi- 
ment, is central to us. Burks’s system, while relevant to economics, 
fascinating on its own terms, and stimulating as an example, does not 
confront in detail some of the central problems of inference in eco- 
nomics. 

Hicks is not explicit about how his broad conclusions apply to 
specific types of current work in economics. (Indeed, except for the 
author’s own works, no publications in macroeconomics, probability, 
or statistics less than 15 years old are cited.) He does in his final 
paragraph appear to condemn all use of probability methods on cross 
sections not generated by random sampling: 


Thus it is not at all sensible to take a small number of 
observations (sometimes no more than a dozen observations) 
and to use the rules of probability theory to deduce from 
them a “significant” general law. For we are assuming, if we 
do so, that the variations from one to another of the obser- 
vations are random, so that if we had a larger sample (as we 
do not) they would by some averaging tend to disappear. But 
what nonsense this is when the observations are derived, as 
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not infrequently happens, from different countries, or lo- 
calities, or industries—entities about which we may well have 
relevant information, but which we have deliberately de- 
cided, by our procedure, to ignore. [Pp. 121-22] 


I think Hicks is wrong here. Any use of experience to reach better 
decisions must involve “deliberately ignoring” information, whether 
the experience is the historical record or a sequence of repeated 
experiments. In physical experiments there is always observational 
error, which is treated as random. A good experimenter may have 
many bits of information he judges relevant to explaining the appar- 
ent random error in a sequence of experiments. In writing up the 
experiments, though, he does not report which research assistant was 
on duty at each meter reading, together with an analysis of each 
assistant’s bias and mean square deviation as a meter reader. In 
physical sciences there is substantial consensus as to what is to be 
treated as random. 

In social sciences, with history the most extreme instance, there is 
much less consensus. It is nonetheless necessary to ignore some in- 
formation, to treat it as unsystematic or random, if the past is to be 
useful. Because there is inevitably room for dispute about what is 
systematic and what is not in analyzing history, historians do not 
dogmatically suppress discussion of information which some might 
consider “random” or unsystematic. Nonetheless, when a decision has 
to be made, if history is to be relevant, some choice must be made as to 
what is random. A political decision maker might well need to judge 
how likely it is that a Marxist revolution in his country would be 
followed by political democracy. He might think that the historical 
record concerning previous Marxist revolutions is relevant to this 
question. However, it is obvious that every previous revolution has 
special characteristics which are well known and which are potentially 
relevant to explaining whether the postrevolutionary government 
emerged as politically democratic. In applying the historical record to 
his own problem, the decision maker will have to treat at least some of 
these special characteristics as random. 

The same type of argument justifies, in clinical trials of a new drug, 
reporting statistical summaries of the outcome of therapy rather than, 
case by case, the best guess of the attending physician about why the 
result of therapy was what it was for this case. And the same type of 
argument again justifies using statistical methods for observation on 
countries Or regions even when we could adduce a great deal of 
special information about each observation which might possibly help 
explain its “random” component. 

Economists must inevitably try to sort out systematic patterns from 
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random variations in the past—if only because, unassisted, policy- 
makers would do the same thing more naively. In doing so econ- 
omists will need probabilistic models and statistical methods of 
inference. Like historians, though, they must accept that a single 
agreed view of the causal structure of the record they examine will 
never emerge. Perhaps Hicks’s stimulating but unsatisfying book will 
lead someone to work out a philosophy of inference which confronts 
both these aspects of economics as science. 
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To Save or Savor: 
The Rate of Return to Storing Wine 


Elizabeth Jaeger 
University of Virginia and Freemark Abbey Winery 


For $28,000 one can buy a Jaguar XJ6, 308 shares of McDonnell 
Douglas stock, a one-tenth interest in an Oklahoma gas well, 43 
years of undergraduate life at Stanford, or one bottle of Chateau 
Lafite, vintage 1806. “That’s $1,166.67 an ounce. At this moment gold 
is worth only $275 an ounce, and you can’t smell or drink it” (Saturday 
Review of Literature 1979, p. 38). Nine years ago, $70 could persuade a 
Paris restaurant to part with this same bottle of liquid gold (Goldwyn 
LO ao ace)e 

But for balance admire the growth of a slightly more pedestrian 
vintage, a 1959 Beaulieu estate-bottled Cabernet. Today a case of it 
may be purchased for $1,200. For the more modest budget, $950 will 
buy a case of 1970 Beaulieu George Latour Private Reserve Cabernet. 
All this goes to show that wine, the right wine, in the last 10 years has 
provided the prudently daring investor with a spectacular return. 

The virtues of wine as a liquid asset have been tested recently by 
Krasker (1979). Krasker cannot reject a hypothesis that wine offers no 
extraordinary rate of return; the wine investor can expect no greater 
return than that which accrues to riskless assets. While Krasker finds 
the concept that an investment in wine outperforms Treasury bills in 
nominal return “incompatible with the way economists believe most 
assets are priced,” his result seems counter to intuition. Wine as a 
subcategory of riskless assets seems contrary to wine’s very nature. 

Wine is a living substance. Its maturation process extends well 
beyond the bottling day, lasting as long as 20-40 years in the case of 
some reds, while a few others refuse to retire at 65. But the proper age 
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at which to consume a wine is not some known date but rather a peak 
which must be discovered by the bulk purchaser through ex- 
perimentation. When frequently posed with the question when to 
drink, Harry Waugh, the famous wine writer, would respond, “Just 
imagine looking around a room full of people and somebody saying 
to you, “Tell me at what date each of these individuals is going to 
die’” (Waugh 1968, p. 1). 

This element of uncertainty is reflected in the development of the 
1966 vintage French burgundies. These wines were considered years 
away from full ripeness when, in 1975, out went the cry, “Drink your 
1966’s!”; they had unexpectedly gone over the hill (Robards 1976, p. 
33). 

Such events explain some of the sudden downturns in wine prices. 
Figure 1 traces both the expected and actual maturation of a 
hypothetical vintage and its price through time. If, at some point in 
the life of the wine, investors discover that it has peaked prematurely, 
then they readjust their offer price to reflect this new information. On 
the other hand, a publicized tasting which revealed much life in a 
wine of old vintage could be expected to raise its value to the investor. 

Quality uncertainty need not, by itself, imply that wine will com- 
mand a risk premium unless the variance in the return to wine is large 
relative to the total variance of a portfolio. If wine speculators do not 
hold a “market” portfolio, such a possibility may exist. 

In addition to unanticipated quality changes, wine is affected by 
cyclical demand fluctuations which can severely affect wine prices. For 
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example, the 1974-75 recession coincided with a decline in wine 
prices. This covariance with other asset returns is an additional reason 
that wine will command a risk premium. 

In view of the uncertainty associated with wine’s growth and devel- 
opment and of wine’s covariance with other assets in a potential 
portfolio, Krasker’s empirical finding that investors receive no risk 
premium for holding wine is surprising. This paper suggests an 
alternate method for estimating the wine investor’s return. 

My approach differs from Krasker’s in three important respects. 
Krasker’s data on wine prices are taken from the annual Heublein 
Wine Auction and cover the 4 years 1973-74, 1974-75, 1975-76, and 
1976-77. By limiting his data to these 4 years in particular, he has 
biased his procedure toward finding a subnormal rate of return. ‘Two 
years of his sample of 4 were extremely poor from the perspective of 
the wine industry; in early 1974 and through 1975, newspaper head- 
lines proclaimed that “rivers of surplus wine” existed, reflecting the 
high inventories of wineries and their distributors. In these 2 years, 
wine prices, the French especially, fell precipitously below those prices 
of many preceding years. The negative trend of these figures could be 
expected to impact heavily on wine returns calculated during this 
period. My data include the prices determined by the Heublein Auc- 
tion over 8 years, beginning in 1969 and ending with the 1977 auc- 
tion. Observations covering 4 more years may offset the dampening 
effect of the 1974-75 and 1975-76 auctions on the total return to 
wine storage. 

Second, I eliminate an unknown in Krasker’s regression: the cost of 
storing wine. Where his regression estimate of the cost to the wine 
investor is $16.80 per case per year, I use the cost accounting figure of 
Freemark Abbey Winery, an annual per case storage cost of $.449. 
This number is calculated on the basis of the opportunity cost of the 
storage facility divided by the number of cases. The Freemark Abbey 
storage cost figure should reflect the cost for the auction participants 
who, in general, store wine in commercial cellars. 

For comparison, however, I estimate the annual cellar costs and the 
rate of return to wine storage; in this way I have a basis for comparing 
Krasker’s and my results. 

Finally, my estimation procedure will differ from Krasker’s gener- 
alized least-squares process. Limitations on available computer soft- 
ware forced me to employ an alternative estimation procedure which 
I explain below. 


The Model 


Following Krasker, I restrict my attention to include only red Bor- 
deaux and California Cabernet Sauvignon produced since 1950. 
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Thus, I concentrate on those wines which are commonly found to 
benefit by extended aging, and I exclude those wines which Krasker 
believed were traded as antiques.! An observation is recorded every 
time a particular vintage wine is traded in consecutive years; then over 
the year ¢, t + 1, the rate of return to wine i may be represented by 
(Puitsi — Pu)/Pu, where py is the price of winei | year prior to the price 
observation on the same wine, pj4+4:- 

Krasker tests the hypothesis that the expected rate of return to 
storing wine is equal to the rate of return to riskless assets; that is, 


seas OLE, op = 0: (1) 
Pti 


where @ is the risk premium, 6 is the cost of wine storage, and r 
denotes the average rate of return to 3-month Treasury bills for the 
four quarters which precede the auction. Equation (1) implies the 
following regression equation: 


Pri a Pui —r,= 6+8 1 a Upsii ' (2) 
Pri Pri Pi 
where the error term u;,,; is serially uncorrelated as expectations are 
assumed to be rational, and u;,,;/py is assumed to have mean zero and 
constant variance. 
Because the disturbances are correlated within each year, Krasker 
employed a GLS procedure to derive the estimated equation (stan- 
dard errors in parentheses): 





cS ns een ese (3) 
Pii (037) (.72) Pe Pei 


R? = .03;SE='.24. 


The estimate implies that wine costs $1.40 per bottle ($16.80 per 
case) to store. Although not significantly different from zero, the 
return differential between wine and Treasury bills is negative. 


The Fixed-Effects Model 


Because the sample data involve different numbers of observations 
for each different year, the GLS approach is not straightforward. 
Krasker does not mention how he estimates the off-diagonal compo- 
nent of the variance-covariance matrix for his GLS estimator, and 


1 Although prestige and publicity over bottle content assuredly guided the purchaser 
of the 1806 Chateau Lafite, many truly fine wines have been excluded by the arbitrary 
choice of 1950 for a cutoff point. Nonetheless, I strive to maintain the similarities of 
these two papers where I can, so I have adopted this same breakpoint. 
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instead of writing my own seemingly unrelated regression program, I 
shall consider an alternate method to account for disturbances which 
are correlated within each year: least-squares estimation with dummy 
variables. The dummy variable technique exploits the special feature 
of the combined cross-section/time-series data by explicitly accounting 
for the “constant” portion of each error term for each given cross 
section. 

As in the GLS procedure, this method presumes that each residual 
Uz+1i/Px for observations in a given year may be decomposed into two 
independent components: an individual effect, v4, and a remainder, 
€,. Then, u+1i/Pui = € + Va, where, in terms of the present example, €, 
represents a nonrandom addition to the rate of return to all wine sold 
in the year ¢, and v, becomes the specific classical error term associ- 
ated with the zth wine, that is, 


Evoh; = o2, fort =t' and i =F 
0, fort #t' ori #7. 


The e, component of the rate of return can be incorporated into 
Krasker’s regression equation by the introduction of dummy variables 
to represent given years. In this way, each year is characterized by its 
own special intercept. The regression equation is then a fixed-effects 
model, using Scheffé’s (1959) terminology. 

For ease of exposition, I will denote the dependent variable as y¥. 
The vector y, (k = 1, ... , m) will contain all observations on the 
dependent variables for year k. The variable 1/p, will be denoted z,, 
and z,, will be the vector of observations on z, for year k. The vector v;, 
(k = 1,..., m) will contain all of the observations in v, for year k. 
Finally, i, (k = 2,...,m) represents a vector with entries of one in the 
elements associated with the kth year and zeroes elsewhere. Stacking 
the vectors, let 


Yi Zy Vi 
y Sa: ,Z=]: , andv= 


Ym Zm Vin 


The relationship to be estimated from the combination of time- 
series and cross-section data can then be expressed in the form 


Vi ='Ch Bolo ee le Oy nv, (4) 


where (¢, Bz, B3, - . - , Bm, 5) is a vector of unknown parameters. 
Equation (4) can be estimated efficiently with ordinary least squares. 
The parameter estimates can be used to estimate the quantity (¢ + 
[1/m] Xj2, B;); the risk premium that one would expect to accrue to a 
wine is randomly selected from the total sample. 

The application of this procedure to the wine data indeed reveals a 
premium to the wine investor with one qualification. Running four 
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TABEEN 


THE RETURN TO WINE STORAGE 














REGRESSIONS USING REGRESSIONS USING 
4-YEAR Data (N = 140) 8-YEAR Data (N = 199) 
Regression Storage Cost Regression Storage Cost 
Estimate of Estimate Estimate of Estimate 
Storage Cost $.449 Storage Cost $.449 
Average premium 
to wine investor —.01367 -03812 .08542 .16614 
Standard error 
(average premium) .03399 .0661 .03379 .03296 
t-statistic’ —.4021 5767 2.5279 5.0392 
6 17.25 ASP 16.60 Be: 
(SE) (8.76) (5.13) 
R?2 .29 Bo Al 38 
Standard error .2415 .2439 .2532 .2587 
Mean dependent 
variable .0182 .0168 .1036 .1068 





separate regressions, I have distinguished between the effects on the 
rate of return of (1) increasing the period of observations from 4 to 8 
years and (2) estimating (on the right-hand side) versus incorporating 
(on the left-hand side) the storage cost in the regression equation. 
Specifically, I run two regressions on equation (4), one using Krasker’s 
140 observations taken over 4 years and the second using 199 obser- 
vations from an 8-year time span. In an identical fashion, I run two 
more regressions on the following version of equation (4), 


Wi - 102 Cit pal it Cala V, (5) 
where 6 = .449, the Freemark Abbey storage cost estimate. Using the 
estimates for (¢, Bo, . . - ; Bm), I have calculated é + (1/m) Zit, B; 


together with its standard error. The results appear in table 1. 

Not surprisingly, one estimate confirms Krasker’s findings that the 
expected return to wine lies beneath the return to Treasury bills. This 
regression utilized Krasker’s data over the period 1973-77, where 
1973-75 were very poor performance years for the wine industry. 
And like Krasker’s result, the return is biased downward by the 
unrealistic storage cost estimate produced by the regression: an an- 
nual $17.25 per case. The influence of this estimate can be seen in 
the second regression, where the cellar cost is accounted for using 
the Freemark Abbey winery estimate; a 3.8 percent premium to 
oenophiles separates the wine and Treasury investment. However, as 
in Krasker’s study, neither premium is significantly different from 
zero. 

Expanding the data to cover the 8-year span (1969-77) greatly 
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improves the calculated return to wine. A low dividend of 8.5 percent 
reflects the high estimated cellar cost of $16.60. But wine outperforms 
Treasury bills by an impressive 16.6 percent for the commercial 
establishment for whom the Freemark Abbey cost gauge applies. Both 
premiums are significantly different from zero. 


Reinterpreting Krasker’s Regression Equation 


A somewhat disturbing question which I have yet to address is why 
Krasker’s and my regression equation with unrestricted storage costs, 
that is 





2) 


t+1i — Pti —r,=0+58 Feit 


pti Pri Pri 


give the implausible estimate, attributed to 1/p,, of $16.60 per case 
annually. However, note that 16.60/p, is a quantity on the order of 
magnitude of a rate of return, since the average value of py over the 
8-year period is $410. An alternative interpretation of this result is 
that 8(1/p,,) does not represent annual storage costs but rather an as 
yet unexplained portion of the rate of return to wine storage. 

If the premium of the wine investor can be segregated into two 
effects, one which accrues to the wine regardless of price and a second 
effect related to its nominal price, then 6 + (8/p,) will represent the 
total return to storing wine. In this case 6/p,, captures the premium as 
a function of price. Krasker’s positive estimate for 6, interpreted in 
this light, implies a higher premium to lower-priced wines. The alter- 
native postulate that high-priced wines receive the better rate of 
return would be supported by a negative 6. 

The possibly surprising conclusion that lower-priced wines receive 
higher premiums than their counterparts can be explained by risk. 
Wines attached to smaller nominal price tags for some reason entail 
greater risk and return for the investor. Thus price stands as a proxy 
for risk, where risk can be measured by the variance of the rate of 
return. The validity of this explanation for Krasker’s “cost” estimate 
relies on the variance of the rate of return of high- and low-priced 
wines, and support for this interpretation of the regression equation 
would be gained by a measurably higher rate of return variance in the 
low-price group. Ordering the 199 observations from the 8 auction 
years by nominal price, I have computed the mean rate of return and 
variance for the 65 highest- and 66 lowest-priced wines. While wines 
in the high price range offered the investor an average rate of return 
of .03 (3 percent) with a variance of .051, the lowest one-third 
provided a mean rate of return of .3587 (36 percent) with a variance 
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more than twice as large as the highest third, .1162. Performing an F- 
test, I can reject the hypothesis that the two variances are equal at the 
95 percent level of significance, since the value of the F- statistic is 2.28. 

Repeating the procedure using 140 observations from the 4 auction 
years considered by Krasker yields similar results. The 47 highest- 
priced wines provide a mean return of —.0267, with a variance of 
.044; the 47 lowest-range wines earned an average return of .1843 (18 
percent) with a variance of .086, double that of the high-priced wines. 
Again, based on an F- statistic of 1.95, I can reject the hypothesis that 
the two variances are equal at the 95 percent level of significance. 

Apparently, then, wine has an even higher return than allowed 
earlier in this paper, since previous calculations did not incorporate 
the return associated with a wine’s own price but rather attributed 
that portion of the rate of return to cost. Using the cellar cost statistic 
computed by Freemark Abbey Winery, we can infer from Krasker’s 
regression that 





Puna Pao es bop! 1 Ut+ii 
a ry — 449 - (0 as Seca we 
The amended risk premium to storing wine for the 8-year period, 
derived by computing é + (1/m) 3% B + (8/p), with p = $410, is .1239 
with a standard error of .0672. 

Before proposing an intuitive explanation for the greater risk asso- 
ciated with lower-priced wines, it should be emphasized that “lower 
priced” is relative to this sample of expensive premium wines. By no 
means do I mean to imply that risk is associated with less costly wines 
in general; consistency (i.e., zero risk) is the by-line of Ernest and Julio 
Gallo’s jug wines. 

One plausible interpretation of the result is that the most expensive 
wines, heavily represented by the older vintages, offer greater cer- 
tainty to the investor because of their age. Their more youthful 
counterpart, with a shorter track record, provides the wine buyer with 
unrealized potential, and thus risk, at a discounted price. 


Conclusion 


In reestimating the rate of return premium to wine storage using a 
larger data set, I have reversed Krasker’s findings that wine specula- 
tion yields no greater return than do ‘Treasury bills. In addition, I 
have shown that the use of an actual storage cost measurement allows 
reinterpretation of Krasker’s estimated “storage cost” as part of the 
return to wine. Adding this return to Krasker’s original estimates also 
produces a positive premium to wine storage. 
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On the Relationship between Commodity Price 
Changes and Factor Owners’ Real Positions 


James Cassing 


Australian National University 


Recent contributions to trade theory have utilized the generalized 
variable proportions model in order to identify “natural friends” and 
“natural enemies” of particular relative commodity-price changes 
(see, e.g., Jones and Scheinkman [1977] and Jones [1979, chap. 8] for 
a relevant survey). In the case where the number of factors equals 
the number of commodities, it is now known that every factor has a 
natural enemy in some commodity in the sense that any factor’s reward 
is inversely related to some commodity’s price. Also, it is known that, 
in general, a given factor need not have a natural friend in the sense 
that the factor’s reward may not increase more than proportionately 
to any commodity’s price increase. 

However, this result should not be interpreted to imply that in the 
n X n case some individual whose endowment consists only of a given 
factor may not be able to increase his real income through an appro- 
priate commodity-price rise. On the contrary, when we take into 
account an individual’s expenditure pattern, it can be shown that any 
individual can improve his real position through some commodity- 
price rise. 

In order to see this, consider the standard n X n variable propor- 
tions trade model. Let 8 denote the n X n distributive share matrix, w 
the n xX 1 vector of factor rewards, and p the n X 1 vector of 
commodity prices. Using “*” to denote a proportionate change, we 
have the usual price equation 


A 


Oa =p. (1) 
This paper has benefited from the helpful comments of Peter Warr and an anony- 
mous referee. 
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We assume that 9’ is nonsingular so that 
w= 0’. (2) 


Jones and Scheinkman show with an example that for any factor the 
ith row of @’-! may contain elements all less than unity. 

But the change in an individual's real position also depends upon 
expenditure patterns. Suppose that agent /’s endowment consists only 
of the ith factor. Denote by $i the /th agent's expenditure share of 
income on commodity k. Then the real income effect for individual / 
of a change in any commodity k’s price is given by 


thi — bine (3) 
Substitution from (2) yields the real income effect 
(On? — b',)Prs (4) 


where 0};'! is the i — kth element of 90’"?. 

The real position of agent / improves if the term in parentheses in 
equation (4) is positive. We know that this term is negative for some 
k, since all factors have natural enemies. But the sum of these 
coefficients over all k is zero, since 8’ is row stochastic, so that 9’! has 
unit row sums, and since all income is spent. That is, 


n n 


SO oer 1 (5) 
k 


k=1 — 


Therefore, since the summation has at least one negative term, it must 
have a positive term as well. The real position of any individual can be 
improved through an appropriate commodity-price rise. 

As it stands, this result is not the exact counterpart of the “existence 
of a natural enemy” result which depends on only the technology. A 
natural friend would be acommodity for which a price rise induces an 
improvement in a factor’s real wage, independent of the expenditure 
shares of its different owners used to compute each owner’s price 
deflator. Rather, we show here only that in the n X n case for each 
individual / whose endowment consists only of a given factor, there 
exists a commodity such that, when this commodity price rises, the 
real wage of that factor, when deflated by /’s price index, rises. Thus 
every individual is a proponent of some—not necessarily the same— 
commodity-price increase. And, of course, if all owners of a single 
factor happen to display the same expenditure shares, then there 
would be a sense in which every factor can improve its real position 
through some commodity-price rise. 
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Stochastic Implications of the 
Life Cycle-Permanent Income Hypothesis: 
Evidence for the U.K. Economy 


Vince Daly and George Hadjimatheou 


Kingston Polytechnic 


In a previous issue of this Jowrnal, Hall (1978) presented a simple life 
cycle-permanent income model of the consumption decision ac- 
cording to which consumption expenditures are predicted to follow a 
random walk with trend; that is, 


Gp = AG Ei ez Neel, (1) 


where ¢; is white noise. Hall’s stated purpose in deriving this equation 
was to provide a testable implication of the pure life cycle-permanent 
income hypothesis which avoided the econometric problems associ- 
ated with a consumption function in which measured income is in- 
cluded as an explanatory variable. The testable implication is that, 
apart from this period’s consumption expenditure, no variable 
observable in this or in earlier periods should show any predictive 
power for next period’s consumption expenditure. For U.S. quarterly 
data 1948(1)—1977(1), Hall found this to be the case, with the excep- 
tion of movements in the real value of corporate stock.' Given the 
importance of the hypothesis it would seem worthwhile to examine 
the extent to which it is substantiated by alternative economic his- 
tories, and so we present here the results obtained from U.K. quar- 
terly data in the postwar period. 


‘In Hall’s paper the significance of the corporate stock variable is not considered as 
contradictory to the tested hypothesis when the model is modified to allow for a lagged 
adjustment of consumption to a change in permanent income. The implicit assumption 
is that, given the random walk nature of corporate stock values, changes in them are 
expected to be correlated with changes in permanent income. 
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The data are taken from Economic Trends—Annual Supplement 1980, 
and, following Hall's example, variables were, where appropriate, 
measured per capita and deflated by the implicit price deflator of 
nondurable consumption expenditure, the latter being the measure 
of consumption adopted in the original study. Since the test proce- 
dure involved searching for variables which could supplement the 
predictive power of equation (1), a large number of regression results 
were generated which we do not report below; we simply note that 
unemployment, the inflation rate, and the rate of interest, all of which 
have been proposed on occasion as determinants of C,, did not con- 
tribute significantly to the explanatory power of a regression equa- 
tion containing C;,_,. 

Our substantive result is that regression specifications which in- 
clude lagged values of disposable income or liquid assets, or the 
period-to-period changes in these variables, and regressions which 
include consumption expenditures lagged by more than one period 
can significantly outperform the simpler specification in which con- 
sumption expenditure is predicted only by its own lagged value.? 
Thus some doubt is cast on both the validity of the pure life cycle- 
permanent income hypothesis, in the case of the United Kingdom, 
and the policy prescriptions that might follow from an acceptance of 
the hypothesis. 

It can be seen in the Appendix that both consumption expenditure 
lagged more than once and lagged income appear with coefficients 
which are significantly different from zero. Furthermore, for the 
shorter period 1964—78 for which data on liquid assets are available, 
changes in the latter in the recent past seem to contribute significantly 
to the explanatory power of the regression. In all cases the size of the 
F ,-statistic suggests that the hypothesis that the additional variables do 
not contribute to the regression should be rejected. 

Finally, repeated estimation for different subperiods of the simple 
regression of C, on C;_, indicates that the coefficient on the latter 


* The relevance of liquid assets in explaining consumers’ expenditure has been 
suggested in a number of recent studies; see, e.g., Townend 1976. 

* When 10 observations of each sample are retained for a postsample parameter 
stability test of the alternative specifications, the x? (Davidson et al. 1978, p. 674) and 
Chow’s F-test suggest thai in all cases the hypotheses of postsample parameter stability 
and of no structural change should not be rejected. What is more important, however, 
for the present study is that the specification which allows for the influence of changes 
in liquid assets seems to provide more accutate forecasts than the pure life cycle— 
permanent income hypothesis. The superiority of the alternative specification is also 
maintained when stability is tested for post-1972 data—a critical period for economet- 
ric forecasting, coinciding with a dramatic increase in the U.K. saving ratio (Had- 
jimatheou 1979). For this period, however, the value of x? suggests that the hypothesis 
of postsample parameter stability should be rejected for both specifications while the 
F-value is still not statistically significant. 
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variable is unstable. More specifically, as can be seen from the follow- 
ing values of X, the greater the time span of the sample, moving 
backward toward the less recent past, the higher the level of the 
coefficient: (i) 1973(1)-1978(4), 1.00125; (ii) 1964(1)—1978(4), 
1.00374; (iii) 1956(1)-1978(4), 1.00413; (av) 1964(1)—1972(4), 
1.00483; (v) 1956(1)-1972(4), 1.00497 (Chow test: ii and v, F (24,66) 
= 3.14; ii and iv, F (24,34) = 2.21). In the context of a more con- 
ventional consumption function this is not surprising, given the 
marked downward trend of the average propensity to consume in the 
United Kingdom, especially in the 1970s. This observation, however, 
does not seem to lend support to Hall’s statement that constancy of X 
is expected to “be a good approximation, at least over a decade or 
two” (Hall 1978, p. 975). 

In conclusion, the present evidence seems to suggest rejection of 
the pure life cycle-permanent income hypothesis in the U.K. case. 
Furthermore, the latter cannot explain the revealed differences be- 
tween the two countries. It may be tempting to suggest that English 
people are less “rational” than Americans, but such a pronouncement 
would certainly need an explanation. 

What is probably needed is a more general theory of the consump- 
tion decision which encompasses the different experience of the two 
countries as special cases. It may be that binding financial constraints, 
changes in the age structure and the distribution of income, as well as 
changes in interest rates and the composition of wealth, are too 
important to be ignored. The importance of some of these factors has, 
after all, been recognized, either explicitly or implicitly, by the life 
cycle and/or permanent income hypotheses. 


Appendix 
Dependent Variable is C 


1956(1)-1978(4), N = 91: 


1OOLISC ae (Al) 
(1055) 
R?= 0.994, S=2.108, SSR = 398.2. 


01819C{)-+'0.9197C,., +10: 12012) — 0. 1409Ven 
ES) (2.1) (2.9) (3.6) (A1.1) 
R2=0.995, S=1.986, SSR=343.1,  F,(3,87) = 4.66. 
1964(1)—1978(4), N = 59: 
1.00374C,_, 
(780) (A2) 
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R2 = 0.971, S = 2.441, SSR = 346.0. 


OA971C.2, + 0.2353C,., + 0.273C,_, 
(4.1) (1.67) (2.3) (A2.1) 
+ 0.0895AY,_, + 0.115ALA,_,, 
. (1.92) (5.2) 
R* = 0.9884, S= 1.881, SSR=191.0, F,(4,54) = 10.96. 


0.604C,_, + 0.442C,_. + 0.0831Y,, 


(4.8) (3.6) (1.60) 
— 0.105Y;-» + 0.0886LA,_, — 0.0926LA,_», (A2.2) 
(1.79) (3.8) (4.0) 


R? = 0.988, S= 1.96, SSR = 204, F,(5,53) = 7.38. 


Numbers in parentheses are f-statistics; S = standard error of estimate, SSR = 
sum of the squares of the residuals, and F, is a test for the significance of 
additional parameters; AY,_, = ¥;-, — Y;-2 and ALA,_, = LA;_, — LA;_-»; LA = 
selected liquid assets of the personal sector as given in Financial Statistics. 
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Miscellany 





Why There Are No Risk Preferrers 


Somewhat more than 100 years ago, Smith’s celebrated diamond- 
water paradox was solved by the observation that however valuable 
water might be to someone who had none, if it was produced and sold 
in unlimited quantities at a price p consumers would adjust their rate 
of consumption so that its marginal value to them was at most p. The 
same argument implies that behavior under uncertainty depends not 
merely on the actor’s utility function but also on the cost of producing 
(and absorbing) risk. However fond I may be of uncertainty, I will not 
buy it at a high price (by choosing a profession whose returns have 
high variance but low expected value) when I can get it at a low one (in 
Las Vegas). 

Risk can be produced inexpensively; existing sellers in Las Vegas 
and elsewhere charge only a few percent of the expected value of the 
gamble, part of which must be regarded as payment not for the risk 
itself but for the palatial surroundings in which it is produced. Hence, 
absent special circumstances (such as a government monopoly of risk 
production), we may expect individuals in equilibrium to exhibit at 
most a small degree of risk preference, for the same reason that they 
exhibit at most a small marginal utility for water. 

Insurance, unlike risk, is inherently costly to produce because of the 
problems of moral hazard. Moral hazard is a major cost of insurance 
but a minor cost of gambling because a gambling game, being created 
for the purpose of producing risk, can be designed to almost elimi- 
nate the ability of participants to spend resources on influencing the 
outcome. The seller of insurance faces a preexisting situation in which 
it is likely that the buyer (of, say, unemployment insurance) can and 
will affect the odds by his actions. Hence insurance is costly to pro- 
duce and individuals in equilibrium may exhibit a substantial degree 
of risk avoidance. 


David FRIEDMAN 
Virgima Polytechnic Institute 
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The Regulatory Process and Labor Earnings. By RONALD G. EHRENBERG. 
New York: Academic Press, 1979. Pp. xx+204. $15.00. 


The irony is that this study was not published in the Bell Journal of Economics. 
Ehrenberg’s argument is that the wage rates of workers in the New York 
Telephone Company are above those of other equally qualified workers and 
that therefore the New York Public Service Commission should not allow 
further wage increases to be passed on as higher telephone rates. As I read 
through his argument I wondered how Ehrenberg would have treated a 
request to cover increased publication costs by the editors of the Bell Journal. 
A related tactic did not escape notice by the leaders of the Communication 
Workers of America (CWA), however, who were on the other side of the 
regulatory proceeding in which Ehrenberg participated. In addition to im- 
plying that Ehrenberg was a mere hired gun for Alfred Kahn (then chairman 
of the New York State Public Service Commission), according to Ehrenberg, 
“. .. political pressure was put on my employer, the New York State School of 
Industrial and Labor Relations (ILR) at Cornell University, by the New York 
State AFL-CIO, which cancelled a scheduled conference at Cornell, implicitly 
suggesting that it would be wise if ILR faculty did not get involved in similar 
cases in the future” (pp. 147-48). The hero of this part of the story was 
apparently Bob McKersie (then dean of the ILR school but now at MIT) who 
refused to take institutional responsibility for an individual faculty member’s 
research conclusions. 

I think most economists will find this a fascinating study. In fact, I would 
urge any economist looking for a good read while on holiday to take this 
(relatively inexpensive) book along. I took it and a pretty good paperback tor 
a week on Martha’s Vineyard (spending a whole summer there is apparently a 
prerequisite for becoming president of the AEA) and actually finished 
Ehrenberg’s book first. For one thing, it is well written. Regression equations 
and the like are kept in an appendix. As a consequence the book would make 
excellent supplementary reading for graduate or undergraduate courses in 
labor economics, industrial organization, public policy, or applied economet- 
rics. 

For another thing, the book tells a coherent story about a complete policy- 
oriented research project from start to finish. The 1976 inception of Ehren- 
berg’s study resulted from an unprecedented request by the New York 
Telephone Company for a rate increase to cover future, yet unnegotiated, 
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increases in wages and benefits. The conclusion of the study comes a year 
later when the Public Service Commission declines to accept Ehrenberg’s logic 
and therefore finds his empirical calculations inappropriate to the decision at 
hand. In the meantime, however, the union-management negotiations had 
been completed under the shadow of a possibly adverse decision on cost 
pass-through, and, I would bet, this was just the purpose for which Ehren- 
berg’s study was initiated by the Public Service Commission staff in the first 
place. 

Ehrenberg’s study is also coherent in another dimension. It reflects a 
remarkable breadth of substantive argument over a range of subjects cover- 
ing labor law, the determinants of unionization and bargaining structure, as 
well as some very convincing empirical economics. For labor economists there 
are some special treats. Ehrenberg tells us that he billed the Public Service 
Commission staff for 40 days of consulting time (at an undisclosed rate), while 
National Economic Research Associates billed New York Telephone for 87 
days of consulting time (at an undisclosed, but higher, rate). | am not sure 
what this is supposed to imply for public policy, but it does raise a larger issue. 
As the real academic incomes of economists have fallen throughout the last 
decade (I recently calculated that junior faculty taking jobs at my university 
this year will receive real salaries 22 percent below the starting salaries 12 
years ago), their efforts have naturally turned toward the more practical and 
remunerative. One cannot help wondering what effect this trend will have on 
the profession’s research output as this income squeeze intensifies during the 
decade of falling college enrollments that we are entering. 

The core of Ehrenberg’s study is his empirical comparison of wage rates 
and fringe benefits at New York Telephone with the wage rates of other 
workers. Ehrenberg deliberately chooses “other workers” to be other nonun- 
ion workers with similar skills, a decision the Public Service Commission 
ultimately found inappropriate. Wage comparisons are first made using sev- 
eral surveys from the Bureau of Labor Statistics and other sources. Since 
worker human capital is difficult to control in these comparisons, standard 
wage regressions from the 1/100 sample of the 1970 Census data for New 
York State are also computed. The telephone company is so large an em- 
ployer, and New York so large a state, that this procedure produces relatively 
precise wage-differential estimates. Taken together, I found Ehrenberg’s 
empirical case convincing, especially when bolstered with the extraordinarily 
low relative quit rates at New York Telephone. In sum, I think there is little 
doubt that wage rates at New York Telephone had by 1976 been increased 
above market-clearing levels and that many of the employees of this company 
were earning economic rents. The question that naturally arises, therefore, is 
what attitude the New York Public Service Commission should have taken 
toward this stylized fact on the occasion of a rate-increase request. 

The administrative law judge in this regulatory proceeding recommended 
that this “fact” be ignored because to do otherwise would be to interfere in the 
collective-bargaining process in violation of federal labor law. Thus, the ad- 
ministrative law judge concluded that whether to use Ehrenberg’s wage com- 
parisons was mainly a “legal issue.” As anyone who has consulted with lawyers 
will understand, a “legal issue” is primarily one for which there is no accepted 
answer. In fact, Ehrenberg shows that as a matter of law the administrative 
law judge was simply incorrect on this point and the Public Service Commis- 
sion concluded similarly. Despite the commission’s conclusion that disallowing 
excessive labor-cost increases was legally permissible, however, the commis- 
sion explicitly rejected Ehrenberg’s wage comparisons as inappropriate. 
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In effect, Ehrenberg compared the average wage rate in a unionized 
monopoly firm against the average of all other wage rates, union and non- 
union. Since the majority of all workers are nonunion, this is primarily a 
comparison of the wage rates of unionized workers in a monopoly firm 
against the wage rates of nonunion workers. The commission argued that the 
average wage rate of the workers at New York Telephone should have been 
compared against the average wage rate of other unionized workers. Since 
there is an empirically established overall union/nonunion wage differential, 
this comparison would essentially have uncovered whether the union/ 
nonunion wage differential was systematically greater in a particular 
regulated-monopoly industry. If this had been the case, the commission 
apparently would have been prepared to disallow further labor-cost increases 
as within the scope of its mandate to regulate monopoly power but not union 
power. 

Ehrenberg really offers no convincing argument against the commission’s 
position. Moreover, although he does not offer the necessary empirical work 
to test the appropriate factual issues, I suspect that such a study would not 
demonstrate that union/nonunion wage differentials were higher among 
unionized employees of monopoly firms than among other unionized work- 
ers. Historically, for example, union/nonunion wage differentials have been 
highest among construction workers and mine workers, both relatively com- 
petitive industries. 

The difficulty here is that the commission’s position essentially takes the 
presence of unionization as exogenous and asks merely whether the presence 
of regulated monopoly exacerbates union wage pressures. This position ig- 
nores the really important question of whether an industry might not have 
become unionized because it is a regulated monopoly. In this case the presence 
of the union is endogenous to the regulatory process, and the appropriate 
wage comparison is akin to the one Ehrenberg offers. The anecdotal evidence 
he presents of the evolution of the Bell System bargaining structure as well as 
the well-known anecdotal evidence regarding the trucking and railroad in- 
dustries and the history of the Davis-Bacon Act in contract construction all 
support this interpretation. As Ehrenberg’s experience clearly demonstrates, 
however, making this case before a regulatory commission 1s likely to be a 
thankless task, and perhaps not very remunerative either. 


ORLEY ASHENFELTER 
Princeton University 


Law, Legislation and Liberty: A New Statement of the Liberal Principles of Justice and 
Political Economy. Vol. 3: The Political Order of a Free People. By FRiEDRICH A. 
HAYEK. 

Chicago: University of Chicago Press, 1979. Pp. 243. $14.00; London and 
Henley: Routledge & Kegan Paul, 1979. Pp. 243. £5.95. 


But to avoid this [destroying our civilization] we must shed the 
illusion that we can deliberately create the future of mankind. 
This is the final conclusion of the forty years which I have 
devoted. ... 


I want to thank Kaj Areskoug, Fritz Machlup, Ingo Walter, and Lee Wohlfert for 
comments on an earlier draft of this review. 


604 JOURNAL OF POLITICAL ECONOMY 


These dramatic words end the third and last volume of Law, Legislation and 
Liberty: A New Statement of the Liberal Principles of Justice and Political Economy 
(hereafter cited as LLL) by Nobel laureate Friedrich A. Hayek. The recently 
published volume, The Political Order of a Free People, analyzes the undermin- 
ing of democratic principles and liberty in parliamentary democracies and 
concludes with an outline of a new constitutional framework. 

The message of Hayek’s voluminous work since World War II" is, to put it 
briefly, that the current political system of Western democracies is bound to 
cause increasing coercion of individuals by governments, private groups, and 
firms, and that governmental decisions will increasingly be contrary to the 
opinions of the majority. He argues in volume 3 that the unlimited powers 
(sovereignty) of governments—combined with the belief and expectation that 
governments can direct production, consumption, and the distribution of 
income to maximize social welfare—are the necessary and sufficient condi- 
tions for this development. The constitutional proposal is designed to subject 
the government to substantial constraints in its use of coercive powers. 

The previous two volumes of LLL (Hayek 1973, 19766) provide the foun- 
dation for the analyses of the political process in volume 3, but the last can be 
read independently. Rules and Order analyzes the role of rules and law in a 
“free” society—one in which no individual can be coerced to act in accordance 
with the will of others. Law must represent general principles of just conduct 
on which there is common agreement. Coercion must be used only to enforce 
such general principles applicable to all individuals. The Mirage of Social Justice 
discusses the limits of reason and rationality in social design and the impossi- 
bility of defining social justice. Some arguments in these two volumes will be 
referred to below to clarify the discussion of volume 3. 


The “Bargaining” Democracy and Logrolling 


Hayek uses the concept of the bargaining democracy to describe a system of 
representative assemblies within which different interest groups can trade in 
vote support for each other’s proposals. In the theory of public finance, this is 
called logrolling? and is generally regarded as an improvement in the demo- 
cratic process because the strength of preferences can be allowed to affect the 
outcome. Hayek, on the other hand, argues that the trade in vote support 
contributes to results that do not reflect the opinions of the majority. These 
views are clearly contradictory, yet the abstract nature of Hayek’s analysis of 
the democratic process makes a comparison with conventional public finance 
difficult. However, Hayek’s criticism of trade in vote support within repre- 
sentative assemblies is directed at the wide range of issues on which such 
practices are possible rather than at the principle of logrolling itself. The 
argument is derived from the distinction between votes on general rules of 
just conduct applicable to all citizens (i.e., legislation proper, such as private 
and criminal law) and votes on the distribution of payments and benefits 
among individuals and groups in society. The latter include all votes on taxes 
and transfer payments, allocation decisions with income redistribution impli- 
cations, and “directives” that discriminate in favor of certain groups. The 
representative assemblies in all democracies do in fact practice logrolling on 
both types of issues. It is, therefore, possible that the distribution of benefits 


* This work includes such classics as The Road to Serfdom (1944) and The Constitution of 
Liberty (1960). 
* See, e.g., Tullock 1969. 
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on which a majority decide at the same time violates general rules on which 
the same majority agree. For example, a principle of taxation can be ac- 
cepted while special benefits (e.g., tariffs and subsidies) with implicit positive 
or negative tax equivalents are passed by the same assembly, violating the 
agreed-on principle. It appears that the theory of logrolling is valid for votes 
on the distribution of benefits and payments only when no general agreed-on 
principle is applicable. Only pure collective goods are likely to satisfy this 
condition when a principle of taxation has been laid down, because individu- 
als can be assigned tax equivalents for other publicly provided services and 
benefits. 

The coercive powers of governments to benefit some groups at the expense 
of others, in addition to enforcing generally agreed-on principles, is the 
source of what Hayek calls “the miscarriage of the democratic ideal” (p. 98). 
These powers make it possible and advantageous to trade in specific benefits 
as well as to trade in benefits for support of general principles. Moreover, 
well-organized interests are likely to gain disproportionate weights in the 
democratic process—“there is no limit to the blackmail to which governments 
will be subject” when “no superior judiciary authority can prevent the legis- 
lature from granting privileges to particular groups” (p. 11).? The final 
outcome of the democratic process need not, then, correspond to anybody’s 
opinion on what is right or to the will of the majority. Hayek argues, in fact, 
that this concept can be stated only in terms of general rules and principles.* 
Behavioral incentives of the rules are lost and the will of the majority cannot 
be given contents when representatives decide both on general rules and on 
adjustments to the outcome of individuals’ activities that are performed 
subject to these rules. 


A Model Constitution 


Hayek outlines a constitution designed both to make the outcome of the 
democratic process coincide with principles held by the majority and to 
minimize the degree of coercion in society.’ The proposal is very simple: 
Distinguish between legislation proper and decisions on government spend- 
ing, administration, and regulation by separating the two functions into two 
distinct assemblies elected by entirely different procedures. Make the gov- 
ernment body subject to the general rules of conduct decided upon by the 


3 A suggestion like Friedman and Friedman’s (1980) to legally prohibit the use of 
tariffs for the benefit of special-interest groups can obviously not get to the root of the 
problem of the democracy in Hayek’s view as long as the government has the coercive 
power to use a multitude of alternative means of creating special benefits for the same 
groups. 

+ Note that Hayek regards the state of democracy in the United States as threatened 
as are the parliamentary democracies in Europe because the judiciary branch of the 
American government does generally not subject decisions of Congress to a test against 
general principles—though this may have been the intention of the Founding Fathers. 

° This argument is developed in vol. 2 of LLL. 

6 Fritz Machlup has pointed out to me that one could argue that Hayek’s proposal for 
constitutional reform contradicts his own views on the creation of human institutions. 
While Hayek argues that human institutions cannot be designed rationally and deliber- 
ately for the benefit of mankind, he also argues that they cannot develop spontaneously 
in accordance with people’s preferences unless the conditions for such developments 
exist. The proposal constitutes a deliberate design for the purpose of creating such 


conditions. 
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legislative assembly. Hayek’s model constitution also contains a general decla- 
ration of rights and an important definition of what qualifies as law—a 
general rule of just conduct. A constitutional court would be established to 
test the appropriateness of the legislature’s decisions against such a definition. 

The legislative assembly would be responsible for the body of criminal and 
private law, the principles of taxation, general regulations for safety and 
health, rules to secure competitive markets, corporate law, and the like. The 
coercive powers of governments would be limited to the enforcement of these 
general rules and principles. 

The government assembly would decide on the use of material and human 
resources entrusted to the public sector. The size and the general purposes of 
expenditures would be limited only indirectly by the general rules of conduct 
set down by the legislative assembly and by people’s willingness to pay taxes. 
The general principles of taxation decided upon in the legislative assembly 
would make citizens aware of their share in payments for specific services. 
This would prevent the current practice of disguising tax burdens to “make 
those who will ultimately have to bear it [the burden] as little aware of it as 
possible” (p. 127). 

Would the proposed constitution achieve its purposes? A critical issue 
seems to be whether the legislative assembly can be prevented from instituting 
laws favoring large groups or wealthy interests. Hayek presents a number of 
suggestions in this regard. More important, however, is the definition of “law” 
and the role of the constitutional court in evaluating the constitutionality of 
legislation against this definition. Hayek characterizes (p. 109) a law as nega- 
tive in the sense that it does not “aim at achieving concrete purposes,” it must 
be applicable to an “indefinite number of unknown future instances,” and it 
must exclude “all provisions intended or known to affect principally particu- 
lar identifiable individuals or groups.” This definition would apparently rule 
out legislation covering such issues as minimum wages and price controls. 
However, the boundary between measures to improve information spreading 
in the market and the protection of particular firms or groups is sometimes 
extremely vague, as in the case of legislation on health standards, consumer 
protection, and occupational safety. It is therefore easy to imagine situations 
in which the members of the constitutional court assume a critical role. The 
experience with the U.S. Supreme Court indicates that the existence of a 
constitutional court does not necessarily provide a complete safeguard against 
legislation that favors large groups in society. A simple addition to Hayek’s 
proposal could help achieving his purpose—Wicksell’s old suggestion 
(Wicksell 1896) of qualified majorities and relative unanimity. A principle that 
legislation must be passed by, for example, 90 percent of the assembly hardly 
seems objectionable since decisions on general rules must reflect the stan- 
dards of most citizens to be respected and justify coercion in enforcement. 

Where does Hayek’s proposal lead? Debate on it cannot be carried out 
without reference to more specific results. Since the author provides only 
general and abstract clues, I will here attempt to illustrate some potential 
consequences by relating it to a few economic political issues. 

The inflationary bias of current democracies has been a theme in Hayek’s 
earlier writings and is clearly linked to the subject of this volume since 
inflation is one way for governments to disguise the true tax burden. The 
independence of the central bank from the government seems guaranteed 
under the proposed constitution because directives covering the central bank 
will have to take the form of general rules unrelated to specific political and 
economic circumstances. This provides a check on the monetization of gov- 
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ernment fiscal deficits.’ Inflationary pressures could also be reduced under 
the proposed constitution because the government would be unable to bear 
the costs of unemployment and business failures. Most market interventions 
in the form of subsidies to labor or capital are likely to violate the general 
rules by which the government must abide. This could contribute to higher 
wage and price flexibility.* Furthermore, governments must consider its ex- 
penditures more carefully when the corresponding tax burden for each 
individual has been established by the legislature.® 

A more controversial consequence of the proposed constitution would be 
that developments in production, consumption, and income distribution to 
which a majority object have to be accepted unless a general rule has in fact 
been violated. However, the outcome of economic activities may shed new 
light on a particular principle and lead to a change therein. The extremely 
far-reaching implications of this can be illustrated with the examples of 
immigration laws and equity-oriented policies. 

Immigration restrictions appear to be unconstitutional, as do tariffs or 
import quotas, under Hayek’s proposal. Such policy measures are clearly 
directed at identifiable individuals to achieve concrete purposes—economic 
benefits for national residents and firms. Opening up the Western indus- 
trialized economies to unlimited immigration would lead to a social transfor- 
mation of enormous proportions. Most people would probably find the out- 
come unacceptable. Nevertheless, they would have to accept immigration as 
the “will of the majority” unless a general principle could be found to prevent 
immigration without thwarting other desirable outcomes, such as imports of 
competitive foreign goods. 

Most policies aimed at securing certain levels of incomes for individuals or 
groups also have to be given up by governments that must abide by general 
principles. Discrimination is necessary to guarantee the outcome of economic 
activities pertaining to an individual. This is illustrated in volume 2 of LLL by 
a revealing and sharply satirical analysis of the contents of the United Nations 
Universal Declaration of Human Rights. This declaration contains positive rights 
(to outcomes) as well as traditional civil rights (to equal treatment before the 
law as defined by general rules of conduct). The two kinds of rights are simply 
contradictory. Either governments treat unequal people equally, abiding by 
generally accepted rules, or they treat unequal people unequally to secure 
certain outcomes. The only available policy instrument aimed at securing a 
certain income distribution would be the general tax structure. Thus, a 
negative income tax would have to be substituted for large parts of the social 
security system. 

The examples above suffice to show that the constitutional proposal is far 
from politically feasible in the foreseeable future. There is probably no 
significant political group in any democracy that bases its political targets on 


7 Hayek earlier suggested the denationalization of the right to issue money (Hayek 
1976a). Though a possibility, this suggestion is not part of the proposal here. 

“Social responsibility” must be the major check on wage and cost increases in an 
economy in which the government guarantees employment independent of the wage/ 
cost structure (cf. Wihlborg 1978). 

® Hayek’s proposal can here be compared with the current discussion about a con- 
stitutional amendment prohibiting an unbalanced government budget. Just as tariffs 
cannot reduce the political pressures to discriminate in favor of a particular industry, a 
constitutional amendment to balance the budget does not reduce the political incentive 
to tax via inflation. Thus, the political process can be used, e.g., to manipulate the 
definition of the government budget—rendering the amendment meaningless. 
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general principles rather than on desired outcomes. Moreover, the proposal 
does not address the question of whether any outcome under a certain 
principle is acceptable. For example, is it not necessary to discriminate in order 
to secure survival of an individual who spends the negative income tax 
payments almost on receipt? Should hospital care be denied individuals who 
have chosen not to invest in medical insurance? Hayek has previously sug- 
gested ways of resolving these questions with a minimum of government 
coercion (Hayek 1960). It is not clear, however, that these solutions are 
commensurate with strict adherence to the constitutional principles outlined 
here. Despite such questions, Hayek’s proposal deserves serious consideration 
as the starting point for further work by economists and social philosophers. 
A fundamental constitutional change may indeed be the only route to pre- 
vent further erosion of democratic values and individual liberty. Hayek has 
argued—convincingly, in my view—that the coercive powers given to gov- 
ernments, in the belief that progress can and should be planned and directed, 
are the cause of a self-generating process leading to the destruction of the 
decentralized market economies in which a high degree of individual liberty 
remains feasible. 

Hayek’s constitutional proposal implies quite simply that we give up the 
idea of steering and planning the future direction of social and economic 
activities. Instead, we should ask how we can set up a system of general 
principles under which mankind’s knowledge can ensure progress for the 
maximum benefit of all. In Hayek’s words: ““To pretend to know the desirable 
direction of progress seems to me to be the extreme of hubris. . . . All we can 
do is to create favorable conditions for it [progress] and then hope for the 
best” (p. 169). 


CxLas WIHLBORG 
New York University 
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Studies in the Economics of Search. Edited by SreveN A. LippMAN and Joun J. 
McCaLt. 
Amsterdam: North-Holland Publishing Co., 1979. Pp. 225. $41.50 (cloth). 


As a sequel to their excellent survey of the job search literature, Lippman 
and McCall have organized an impressive collection of new papers that 
materially enrich the subject. The collection includes original contributions by 
nine young authors on topics that range from the estimation of the elasticity 
of unemployment duration with respect to unemployment compensation to a 
theoretical demonstration that searching for a bargain is the real world 
substitute for Walras’s auctioneer. The individual essays are of uniformly 
high quality as are the editing and production of the volume. 

According to the editors, each paper is motivated by a desire either to test 
the empirical implications of the so-called “standard” search model or to 
extend its structure for the purpose of deriving a richer menu of behavioral 
implications. Regarding these motives as noble, I have attempted to point out 
the sense in which and the extent to which each author has been led by them. 
Since the essays are related to the standard model in the inverse order of their 
presentation in the volume, my comments read from back to front. 

The standard model in a labor-market context supposes that an individual 
worker samples wage rates sequentially from an urn of offers with a known 
frequency distribution at a rate of one per period. The worker wishes to 
maximize expected lifetime earnings net of the out-of-pocket and opportunity 
costs of search. As is well known, the optimal strategy is to accept the first 
offer in excess of some reservation wage. The reservation wage equates the 
cost of taking one more sample to the expected gain attributable to another 
sample. Under reasonable conditions, the reservation wage is independent of 
search duration to date. 

That a reduction in the cost of search yields an increase in the reservation 
wage is a straightforward implication. Given the standard model, an increase 
in the reservation wage, in turn, implies an increase in the expected time 
required to find an acceptable wage and an increase in expected earnings. 
Because unemployment compensation must be forgone when an offer is 
accepted, it acts as a subsidy to search in the standard model. 

Kathleen Classen tests for these predicted effects by estimating the 
reduced-form relationship between both unemployment duration and post- 
unemployment earning and unemployment compensation. Both the weekly 
benefit received and the maximum duration of benefits are included as 
right-hand-side variables. The analysis yields a strong positive relationship 
between duration and both measures of unemployment compensation liber- 
ality but no statistically significant relationship between eventual earning and 
liberality. Although the second result can be explained away, it is also true 
that alternative explanations exist for a positive effect on duration. The paper 
has many interesting things to say, but as a test of search theory, the results 
are inconclusive. 

One would prefer to test directly for the implied relationship between the 
reservation wage and the cost of search. A major problem in doing so is the 
nonobservability of the reservation-wage rate. Nick Kiefer and George 
Neumann report on an attempt to get around this problem. Their method 
takes advantage of the structure of the standard model. In the paper, they 
show that data on accepted wage rates and realized unemployment durations 
can be used to estimate simultaneously the structural parameters of both the 
reservation-wage equation and the probability of finding acceptable employ- 
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ment function. Although their estimate of the unemployment benefit effect is 
positive in the first case and negative in the second as expected, the estimated 
effects of maximum benefit duration have the “wrong” signs. In spite of this 
ambiguity, the paper represents a valuable contribution to the methodology 
of estimating sequential, discrete choice models. 

The standard model “explains” unemployment as a rational response to 
wage dispersion. Critics argue that this explanation is of little importance, 
because workers with long unemployment spells, those that account for most 
unemployment, do not turn down offers. They simply cannot find a job. By 
allowing unemployed workers to allocate time between search effort and 
nonmarket activity, Ken Burdett has extended the relevance of the search 
theoretic framework even to the case in which there is no wage dispersion. In 
his model, a reduction in job availability on unemployment is amplified by the 
rational allocation of a worker’s time. 

In the paper presented in the volume, Burdett shows that search intensity 
increases, and the reservation wage falls with unemployment duration, as the 
unemployment-benefit period is exhausted. This theoretical result suggests 
the existence of misspecification bias in both of the empirical papers discussed 
above. The allocation of time among search, work, and leisure by an em- 
ployed worker is also investigated. This case is a natural synthesis of classical 
labor supply and search theory. The principal contribution of the paper is to 
incorporate search theoretic considerations into a more general dynamic 
theory of labor supply. 

The papers by Jeffrey Hall, Steven Lippman, and John McCall and by John 
Danforth study the consequences of relaxing the assumption of risk neu- 
trality. Presuming decreasing absolute risk aversion, the implied relationship 
between the reservation wage and nonhuman wealth is positive. Hence, the 
papers represent a rigorous derivation of the long-standing contention that 
the running down of financial assets explains the observed negative associa- 
tion between the reservation wage and duration of unemployment. As a 
technical matter, Hall et al. show that the reservation-wage property need not 
hold when past offers can be recalled. Nevertheless, the probability per 
period of finding acceptable employment decreases with wealth in this case as 
well. Finally, Danforth’s formulation represents a synthesis of the theory of 
household wealth accumulation and the theory of wage search. As Danforth 
points out, the theory implies a process of increasing wealth dispersion over 
time if the assumption of decreasing absolute risk aversion is empirically 
valid. Hence, the paper is also a contribution to the theory of income dis- 
tribution. 

The standard search model characterizes the optimal strategy of a price- 
taking agent seeking a bargain. The market effect of such behavior is a 
reallocation of goods and services toward demanders who value them most 
highly. One might expect that such a process would eventually equalize 
marginal valuations, that is, converge to a competitive equilibrium. Joseph 
Wharton formalizes this idea in the context of a market for homogeneous 
labor services composed of workers who search on the job for a higher wage 
and employers who adjust wage offers in response to search-induced changes 
in employment. The process is modeled as a Markov chain. When the cost of 
search is “small,” a competitive allocation of workers across firms is the only 
absorbing state. Although the analysis is insightful and the approach devel- 
oped 1s amenable to further development, the employer's wage-setting be- 
havior is not shown to be rational in any sense. Other work on this problem 
suggests that strategic behavior on the part of the price-setting agents can 
thwart convergence to competitive equilibrium. 
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Search-induced convergence to a single price raises the following paradox. 
How can the standard search model represent an important contribution to 
positive economics if the market effect of the behavior explained eliminates 
the price dispersion required to observe the behavior? Roger Kormendi 
provides a resolution of this paradox by formulating a market model in a 
pure-exchange context that exhibits price dispersion in equilibrium. One of 
the original features of his model is that every agent plays a role in the price 
formation process. The demonstration that mutually consistent (Nash) price- 
setting strategies exist which define a market equilibrium is a major contri- 
bution of the paper. Although a detailed discussion of the kinds of conditions 
one needs for equilibrium price dispersion in Kormendi’s model is beyond the 
scope of this review, it is important to note that nonstorability of the two 
commodities traded is implicit. If this assumption were relaxed, Kormendi’s 
equilibrium is only “temporary” in the sense that, once allocations have 
changed in response to trade, the agents have incentives to play the game 
again. One suspects that the repeated game might well converge eventually to 
a competitive single price equilibrium. 

It is curious that the matching of job and worker is not explicitly recognized 
in the standard formulation of the individual's problem of finding acceptable 
employment. Louis Wilde contributes to this deficiency by distinguishing 
between two job characteristics: the wage, observable on initial contact, and an 
attribute that must be experienced on the job. The analysis characterizes an 
individual worker’s acceptance strategy assuming knowledge of the joint 
probability distribution over jobs. The result is a formalization of the “job- 
shopping” hypothesis typically offered as an explanation for high quit rates 
among the young. A useful extension of this model would also recognize the 
employer’s problem of learning about a worker’s productivity. 

In the final paper, Charles Stuart applies the standard search model to an 
entirely different subject, the spatial clustering of sellers of both the same and 
different commodities. Specifically, he shows that market centers are more 
attractive to buyers than spatially isolated sellers, because the cost of search 
per unit is smaller and the perceived heterogeneity of the goods offered is 
larger. These sources of agglomeration economies offset the effects of trans- 
portation costs and competition that would otherwise induce dispersion in the 
location of sales outlets. Although there are many insightful observations on 
the subject, the author does not attempt to derive an equilibrium solution to 
the location problem in which all of these forces balance. This would seem to 
be a fruitful area for further research. 

In sum, there is much of value in the volume both to students of labor 
economics and to those interested in applying search theory in other areas of 
economic inquiry. 


DALE T. MORTENSEN 
Northwestern University 


A Theory of Income Distribution. By HAROLD LYDALL. 
New York: Oxford University Press, 1979. Pp. viit+326. $32.50. 


Professor Lydall, of South African birth, received his higher education at 
Oxford. He has taught primarily in Britain and Australia, worked in interna- 
tional agencies in Geneva, and also visited extensively in the United States. 
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His methodology recalls the Burns and Mitchell pragmatic tradition of quan- 
titative institutionalism. That is to say, Lydall begins by searching out statisti- 
cal regularities in his data and then seeks one or more theories to explain 
them, preferably on more than an ad hoc basis. He looks with some suspicion 
on the opposite procedure of deducing a general theory first and then 
seeking out data to test it. 

Although the present volume is indeed more nearly theoretical and less 
exclusively statistical than Lydall’s well-known Structure of Earnings (1968), its 
title may seem misleading to the conventional economist. I myself expected a 
new “high theory” (Shackle’s phrase), or at least a tasteful ragout of existing 
high theories. But after seven “background” chapters of a doctrinal-historical 
nature (which explain why the author feels it desirable to strike out along new 
paths), Lydall’s independent contribution retreats more than half way to 
empiricism. 

Once inoculated against anticipations of high theory—wage funds, iron 
laws, marginal productivities, widows’ cruses, degrees of monopoly—the 
reader is impressed by Lydall’s statistical generalizations, both in themselves 
and as challenges for integration into the rival high theories of our rival 
conventional wisdoms. 

Lydall finds, for example, the following relation between value added per 
worker, which he calls v, and the average size of the employing firm as 
measured by employment, L: v = BL*®~' (B > 0, B > 1). A similar relation 
holds for the average wage, w: w = AL*“' (A > 0, 1 <a < B), and for one-man 
firms (OMF),A =B. The size distribution of firms follows the Pareto law, with 
N as the number of firms: N = RL~? (R > 0, p > 1). (In these three equations, 
both capital and Greek letters are statistical parameters.) 

An ingenious algebraic development (pp. 297, 298) combines these results 
into a theoretical labor share S,, of (9 — B)/(@ — a), derived under conditions 
Lydall calls EPM (entry and product market) competition. In EPM, di- 
minishing returns to scale need never hold as in conventional pure competi- 
tion, but freedom of entry for tiny interstitial OMF is an adequate check on 
the abuses of monopoly and oligopoly—when coupled, I should myself add, 
with plausible threats of regulation or socialization. (Lydall sees this EPM- 
OMF framework as among his major contributions.) The EPM competition is 
atomistic but imperfect. It permits, in particular, imperfections in knowledge, 
unutilized economies of scale, and heterogeneous capital. It is consistent even 
in the long run, Lydall believes, with such phenomena as markup pricing, 
production under increasing returns, and persistent differentials in rates of 
return on investments of different sorts. As to the importance of these 
modifications, my reaction is wait and see, influenced more by the post- 
Chamberlin history of the “large group case” of monopolistic competition 
than by any analytical flaw in EPM-OMF or its applications. 

Two further innovations in Lydall’s approach and results relate to the 
dynamics of distribution and development. They explore the Kuznets con- 
jecture that, in Lydall’s words (pp. 129, 130), “the degree of inequality is fairly 
low in the poorest countries, increases with per capita income up to a certain 
level, and thereafter follows a continuous downward trend.” In a cross- 
country study, a Lydall student (T. Das) derived an empirical estimate of the 
determinants of the Gini concentration ratio, G, with dependent variables X 
(income per capita), its reciprocal, and four D; or “dummy” variables:'G = 
0.784 — 0.050 log X — 12.184X~" + 0.051D, + 0.124D, — 0.033D, — 0.007D,. 


1 T interpret D, as the proportion of income recipients (to total population), D, as the 
proportion of economically active persons, D; as the proportion in the rural (or 
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Lydall provides later (pp. 219, 220) his own explanation of the Kuznets 
phenomenon. He has divided each economy under study into a traditional 
sector, 1, and a modern sector, 2. The average income, Y;, of each sector is 
related logarithmically to overall average income, Y: log Y, = 1.457 + 0.725 
log Y, log Y, = 3.86 + 0.55 log Y; we also have Y = (1 — p)Y, + pYo, with the 
proportion p rising as development progresses. Lydall assumes income distrib- 
uted lognormally in each sector with a uniform logarithmic standard devia- 
tion. From these results and assumptions he derives by simulation (his table 
12.1) numerical results which not only conform to the Kuznets hypothesis but 
go beyond it in one important respect. The share of the lowest 20 percent of 
the population eventually turns up as development proceeds, which it did not 
appear to do in Kuznets’s original data. 

The most noteworthy feature of Lydall’s discussion of personal income 
distribution (chaps. 12-14) is its conscious endeavor to redress the balance 
between competitors already in the field. As economists know, the human- 
capital theories of T. W. Schultz, Gary Becker, and others have risen to 
dominance over the past quarter century. Lydall represents a reasoned and 
good-natured rebuttal in the ongoing debate. At the same time, he avoids 
what I should call the major error of Jan Tinbergen’s 1975 distribution 
monograph, which is to ignore nonlabor incomes entirely as atavistic residuals 
destined to depart the scene in the near future. 

Postwar distribution theory has been marked by a number of efforts to 
bridge the gaps and construct at least pontoon bridges between deductive 
functional theories and empirical personal ones. These efforts have ranged in 
America alone from the “human-capital” approach mentioned above to 
“job-ladder” and “labor-market segmentation” approaches generally more 
critical of the status quo. Lydall’s study is a major and possibly a highly 
influential addition to this literature, although Lydall’s own summary of his 
innovative conclusions (chap. 15, esp. pp. 285-91) is occasionally less than 
generous to his numerous predecessors. If Lydall’s book is intended as more 
than a contribution, but as the late twentieth-century equivalent of John Bates 
Clark’s late nineteenth-century magnum opus (as that summary chapter 
hints), I fear that it falls sormewhat short. 


MARTIN BRONFENBRENNER 
Duke University 


agricultural) sector, and D, as the proportion in the urban (or nonagricultural) sector. 
(The statistical significance of D; and D, appears dubious in Lydall’s text.) The R? value 
of 0.38, while low, is significantly greater than zero. 
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